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TITLE : Multiple Marker Assay for Detection of Ovarian Cancer 
FIELD OF THE INVENTION 

The invention relates to compositions, kits, and methods for detecting, characterizing, preventing, 
and treating ovarian cancer. 
5 BACKGROUND OF THE INVENTION 

Epithelial ovarian carcinoma is the most common and most lethal of all gynecologic malignancies. 
Only 30% of ovarian tumors are diagnosed at an early stage (Stage I/II), when survival rates reach 90%. The 
rest are diagnosed at an advanced stage, with survival rates of less than 20% ( Greenlee RT, Hill-Harmon 
MB, Murray T, et al., 2001. CA Cancer J Clin .2001;51:15-36). Currently, the only well-accepted 

10 serological marker is CA125, a large glycoprotein of unknown function (Meyer T, Rustin GJ., Br J Cancer 
.2000;82:1535-1538). However, CA125 has limitations as a diagnostic, prognostic and screening tool 
(Holschneider CH, Berek JS, Semin Surg Oncol .2000;19:3-10). Consequently, there is a need to enhance 
the overall diagnostic/prognostic capability of CA125. 

Kallikreins are a subgroup of secreted serine proteases, encoded by highly conserved and tightly 

15 clustered multigene families in humans, rats and mice. The human kallikrein gene family resides on 
chromosome 19ql3.4 and is comprised of 15 members, whose genes are designated as KLKl to KLK15 and 
the corresponding proteins as hKl to hK15 ( Yousef GM, Diamandis EP., Endocr Rev .2001;22:184-2041; 
Yousef GM, Chang A, Scorilas A, et al., Biochem Biophys Res Common. 2000;276:125-133; Diamandis EP, 
Yousef GM, Clements J, et al. Clin Chem .2000;46:1855-1858). Kallikreins are expressed in a wide variety 

20 of tissues and are found in many biological fluids (e.g. cerebrospinal fluid, serum, seminal plasma, milk, 
etc.) where they are predicted to process specific substrates. Kallikreins may participate in cascade reactions 
similar to those involved in digestion, fibrinolysis, coagulation, wound healing and apoptosis (( Yousef GM, 
Diamandis EP., Endocr Rev .2001;22:184-2041). Many kallikreins have been found to be differentially 
expressed in endocrine-related malignancies (Diamandis EP, Yousef GM, Expert Rev. Mol Diagn 

25 .2001;1:182-190), including prostate ( Barry MJ. Clinical practice, N Engl J Med .2001;344:1373-1377; 
Rittenhouse HG, Finlay JA, Mikolajczyk SD, et al., Oit Rev Clin Lab Sci .1998;35:275-368; and Yousef 
GM, Scorilas A, Jung K, et al., J Biol Chem .2001;276:53-61), ovarian ( Kim H, Scorilas A, Katsaros D, et 
al., Br J Cancer, 2001;84:643-650; Anisowicz A, Sotiropoulou G, Stenman, et al., Mol Med .1996;2:624- 
636; Tanimoto H, Underwood LJ, Shigemasa K, et al.,. Cancer .1999;86:2074-2082; Magklara A, Scorilas 

30 A, Katsaros D, et al., Clin Cancer Res .2001;7:806-81 1; Yousef GM, Kyriakopoulou LG, Scorilas A, et al., 
Cancer Res .2001;61:7811-7818; Luo L, Bunting P, Scorilas A, Diamandis EP., Clin Chim Acta 
.2001;306:111-118), breast ( Yousef GM, Magklara A, Chang A, et al., Cancer Res .2001;61:3425- 
343l;Yousef GM, Chang A, Diamandis EP; J Biol Chem .2000;275:11891-11898; and Yousef GM, 
Magklara A, Diamandis EP, Genomics .2000;69:331-341), and testicular cancer ( Luo LY, Rajpert-De Meyts 

35 ER, Jung K, et al.,2001; 85:220-224). In addition, many kallikrein genes examined thus far are under steroid 
hormone regulation, implicating a role for kallikreins in endrocrine-related tissues (Yousef GM, Diamandis 
BP., Endocr Rev., 2001;22:184-204). Furthermore, hK6, hK10 and hKll have been recently identified as 
novel serological ovarian cancer biomarkers ( Luo L, Bunting P, Scorilas A, Diamandis EP., Clin Chim Acta 
.2001;306:111-118 Diamandis EP, Yousef GM, Soosaipillai AR, Bunting P., Clin Biochem. 2000;33:579- 
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583, and Diamandis EP, Okui A, Mitsui S, et al., Cancer Res .2002;62:295-300). 
SUMMARY OF THE INVENTION 

The present invention seeks to overcome the drawbacks inherent in the prior art and seeks to 
provide sensitive and accurate multimarker methods for the detection of ovarian cancer. A plurality of 
5 kallikrein polypeptides and polynucleotides encoding the polypeptides, optionally in combination with 
CA125 and polynucleotides encoding CA125 can have particular application in the detection of ovarian 
cancer. A plurality of kallikrein markers (i.e. two or more of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 
8, kallikrein 10, and kallikrein 11) and polynucleotides encoding the polypeptides, optionally in combination 
with CA125 and polynucleotides encoding CA125, constitute biomarkers for the diagnosis, monitoring, 
10 progression, treatment, and prognosis of ovarian cancer, and they may be used as biomarkers before surgery 
or after relapse. 

In accordance with the methods of the invention, the presence of levels of markers in a sample can 
be assessed, for example by detecting the presence in the sample of (a) polypeptides or polypeptide 
fragments corresponding to the markers; (b) metabolites which are produced directly or indirectly by 
15 polypeptides corresponding to the markers; (c) transcribed nucleic acids or fragments thereof having at least 
a portion with which the markers are substantially identical; and/or (c) transcribed nucleic acids or fragments 
thereof, wherein the nucleic acids hybridize with the markers. 

In an aspect of the invention, a method is provided for detecting ovarian cancer in a patient 
comprising detecting a plurality of kallikrein polypeptides, optionally in combination with CA125, in a 
20 sample from the patient wherein the method provides substantially increased sensitivity compared to 
methods using CA125 alone. In an embodiment, sensitivity is increased by at least 0.5%, 1%, 2%, 3%, 4%, 
5%, 10%, 15%, 20%, 25%, 30%, and 35% compared to using CA125 alone. 

In an embodiment, the invention provides a method for detecting a plurality of kallikrein markers, 
and optionally CA125, associated with ovarian cancer in a patient comprising: 
25 (a) obtaining a sample from a patient; 

(b) detecting or identifying in the sample kallikrein markers, optionally in combination with 
CA125, wherein the kallikrein markers comprise or are selected from the group consisting 
of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1 ; and 

(c) comparing the detected amounts with amounts detected for a standard. 

30 The term "detecf ' or "detecting" includes assaying, assessing, imaging or otherwise establishing the 

presence or absence of the target kallikrein and CA125 polypeptides or polynucleotides encoding the 
polypeptides, subunits thereof, or combinations of reagent bound targets, and the like, or assaying for, 
imaging, ascertaining, establishing, or otherwise determining one or more factual characteristics of ovarian 
cancer, metastasis, stage, or similar conditions. The term encompasses diagnostic, prognostic, and 

35 monitoring applications. The kallikrein polypeptides and CA125 can be detected individually, sequentially, 
or simultaneously. 

According to a method involving kallikrein markers optionally in combination with CA125, the 
levels in the sample of the kallikrein markers (2, 3, 4, 5, or 6) and optionally CA125, wherein the markers 
comprise or are selected from kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 
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11, are compared with the normal levels of the kallikrein markers, and optionally CA125, in samples of the 
same type obtained from controls (e.g. samples from individuals not afflicted with ovarian cancer). 
Significantly different levels in the sample of the kallkrein markers (and optionally CA125) relative to the 
normal levels in a control is indicative of ovarian cancer. 
5 In an embodiment, the invention provides a method for diagnosing and monitoring ovarian 

carcinoma in a subject comprising detecting in a sample from the subject kallikrein markers, and optionally 
CA125, wherein the kallikrein markers comprise or arc selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. The kallikrein markers and CA125 can 
be detected using antibodies that bind to the kallikrein markers and CA125 or parts thereof. 
10 Thus, the invention provides a method of assessing whether a patient is afflicted with or has a pre- 

disposition for ovarian cancer, the method comprising comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a sample from the patient, wherein 
the kallikrein markers comprise kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11; and 
15 (b) normal levels of kallikrein markers, and optionally CA125, in samples of the same type 

obtained from control patients not afflicted with ovarian cancer, wherein significantly 
different levels of the kallikrein markers and optionally CA125, relative to the 
corresponding normal levels of the kallikrein markers, and optionally CA125, is an 
indication that the patient is afflicted with ovarian cancer. 
20 In an embodiment of a method of assessing whether a patient is afflicted with ovarian cancer (e.g. 

screening, detection of a recurrence, reflex testing), the method comprises comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a patient sample, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 
25 (b) normal levels of the kallikrein markers, and optionally CA125, in a control non-ovarian 

cancer sample. 

A significant difference between the levels of the kallikrein markers, and optionally CA125, in the 
patient sample and the normal levels is an indication that the patient is afflicted with ovarian cancer. 

The invention further relates to a method of assessing the efficacy of a therapy for inhibiting 
30 ovarian cancer in a patient This method comprises comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a first sample obtained from the 
patient prior to providing at least a portion of the therapy to the patient, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11 ; and 
35 (b) levels of the kallikrein markers, and optionally CA125, in a second sample obtained from 

the patient following therapy. 
A significant difference between the levels of the kallikrein markers, and optionally CA125, in the 
second sample, relative to the first sample, is an indication that the therapy is efficacious for inhibiting 
ovarian cancer. 
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The "therapy" may be any therapy for treating ovarian cancer including but not limited to 
chemotherapy, immunotherapy, gene therapy, radiation therapy, and surgical removal of tissue. Therefore, 
the method can be used to evaluate a patient before, during, and after therapy, for example, to evaluate the 
reduction in tumor burden. 

5 In an aspect, the invention provides a method for monitoring the progression of ovarian cancer in a 

patient, the method comprising: 

(a) detecting in a patient sample at a first time point, kallikrein markers, and optionally 
CA125, wherein the kallikrein markers comprise or are selected from the group consisting 
of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 1 0, and kallikrein 1 1 ; and 

1 0 (b) repeating step (a) at a subsequent point in time; and 

(c) comparing the levels detected in (a) and (b), and therefrom monitoring the progression of 

ovarian cancer in the patient 
In another aspect, the invention provides a method for assessing the aggressiveness or indolence of 
ovarian cancer (e.g. staging), the method comprising comparing: 
15 (a) levels of kallikrein markers, and optionally CA125, in a patient sample, wherein the 

kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1; and 

(b) normal levels of the kallikrein markers, and optionally CA125 in a control sample. 

A significant difference between the levels in the sample and the normal levels is an indication that 
20 the cancer is aggressive or indolent. 

The invention provides a method for determining whether ovarian cancer has metastasized or is 
likely to metastasize in the future, the method comprising comparing: 

(a) levels of kallikrein markers, and optionally CA125, in a patient sample, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 

25 kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(b) normal levels (or non-metastatic levels) of the kallikrein markers, and optionally CA125, 
in a control sample. 

A significant difference between the levels in the patient sample and the normal levels is an 
indication that the cancer has metastasized or is likely to metastasize in the future. 
30 The invention also provides a method for assessing the potential efficacy of a test agent for 

inhibiting ovarian cancer in a patient, and a method of selecting an agent for inhibiting ovarian cancer in a 
patient. 

The invention further provides a method of inhibiting ovarian cancer in a patient comprising: 
(a) obtaining a sample comprising cancer cells from the patient; 
35 (b) separately maintaining aliquots of the sample in the presence of a plurality of test agents; 

(c) comparing levels of kallikrein markers, and optionally CA125, in each of the aliquots, 
wherein the kallikrein markers comprise or are selected from the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1; 

(d) administering to the patient at least one of the test agents which alters the levels of the 
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kallikrein markers, and optionally CA125, in the aliquot containing that test agent, relative 
to other test agents. 

The invention also contemplates a method of assessing the ovarian carcinogenic potential of a test 
compound comprising: 

5 (a) maintaining separate aliquots of ovarian cells in the presence and absence of the test 

compound; and 

(b) comparing levels of kallikrein markers, and optionally CA125, in each of the aliquots, 
wherein the markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. 
10 A significant difference between the levels of the kallikrein markers, and optionally CA125, in the 

aliquot maintained in the presence of (or exposed to) the test compound relative to the aliquot maintained in 
the absence of the test compound, indicates that the test compound possesses ovarian carcinogenic potential. 

In preferred embodiments of the methods of the invention, the kallikrein markers comprise a 
plurality of kallikrein markers, for example, at least three, four, five, or six of the markers. In particular, a 
15 plurality of kallikrein markers may be selected from the group consisting of kallikrein 5, kallikrein 7, 
kallikrein 8, and kallikrein 10, from the group consisting of kallikrein 7, kallikrein 8, kallikrein 10, and 
kallikrein 1 1, or from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 
and kallikrein 11. 

Other methods of the invention employ one or more polynucleotides capable of hybridizing to 
20 polynucleotides encoding kallikrein markers, and optionally CA125. Methods for detecting polynucleotides 
encoding a kallikrein markers, and optionally CA125, can be used to monitor ovarian cancer by detecting the 
nucleic acids. 

Thus, the present invention relates to a method for diagnosing and monitoring ovarian cancer in a 
sample from a subject comprising isolating nucleic acids, preferably mRNA, from the sample; and detecting 
25 polynucleotides encoding kallikrein markers, and optionally CA125, in the sample. The presence of different 
levels of polynucleotides encoding kallikrein markers, and optionally CA125, in the sample compared to a 
standard or control is indicative of disease, disease stage, and/or prognosis, e.g. longer progression-free and 
overall survival. 

In an embodiment, the invention provides methods for determining the presence or absence of 
30 ovarian cancer in a subject comprising (a) contacting a sample obtained from the subject with 
oligonucleotides that hybridize to polynucleotides encoding kallikrein markers, and optionally CA125; and 
(b) detecting in the sample levels of nucleic acids that hybridize to the polynucleotides relative to a 
predetermined cut-off value, and therefrom determining the presence or absence of ovarian cancer in the 
subject Within certain embodiments, mRNA is detected via polymerase chain reaction using, for example 
35 oligonucleotide primers that hybridize to polynucleotides encoding kallikrein markers, and optionally 
CA125, or complements of such polynucleotides. Within other embodiments, the amount of mRNA is 
detected using a hybridization technique, employing oligonucleotide probes that hybridize to polynucleotides 
encoding kallikrein markers, and optionally CA125, or complements of such polynucleotides. 

When using mRNA detection, the method may be carried out by combining isolated mRNA with 
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reagents to convert to cDNA according to standard methods; treating the converted cDNA with amplification 
reaction reagents (such as cDNA PCR reaction reagents) in a container along with an appropriate mixture of 
nucleic acid primers; reacting the contents of the container to produce amplification products; and analyzing 
the amplification products to detect the presence of polynucleotides encoding kallikrein markers, and 
5 optionally CA125, in the sample. For mRNA the analyzing step may be accomplished using Northern Blot 
analysis to detect the presence of polynucleotides encoding kallikrein markers, and optionally CA125. The 
analysis step may be further accomplished by quantitatively detecting the presence of polynucleotides 
encoding kallikrein markers, and optionally CA125, in the amplification product, and comparing the quantity 
of markers detected against a panel of expected values for the known presence or absence of the kallikrein 
10 markers in normal and malignant tissue derived using similar primers. 

In embodiments of the methods of the invention, a plurality (eg. three, four, five or six) 
polynucleotides encoding kallikrein polypeptides are employed. In particular, a plurality of polynucleotides 
encoding kallikrein markers may be selected from the group consisting of polynucleotides encoding (i) 
kallikrein 5, kallikrein 7, kallikrein 8, and kallikrein 10; (ii) polynucleotides encoding kallikrein 7, kallikrein 
15 8, kallikrein 10, and kallikrein 11; and (iii) polynucleotides encloding kallikrein 5, kallikrein 6, kallikrein7, 
kallikrein 8, kallikrein 10 and kallikrein 1 1. 

The invention also provides a diagnostic composition comprising a plurality of kallikrein 
polypeptides and optionally CA125 polypeptide, or polynucleotides encoding the polypeptides, or agents that 
bind to the polypeptides or polynucleotides. 
20 In an embodiment, the composition comprises probes that specifically hybridize to polynucleotides 

encoding kallikrein markers, and optionally CA125, or fragments thereof. In another embodiment a 
composition is provided comprising specific primer pairs capable of amplifying polynucleotides encoding 
kallikrein markers, and optionally CA125, using polymerase chain reaction methodologies. In a still further 
embodiment, the composition comprises agents that bind to kallikrein markers, and optionally CA125, (e.g. 
25 antibodies) or fragments thereof. Probes, primers, and agents can be labeled with detectable substances. 

In an aspect the invention provides an in vivo method comprising administering to a subject agents 
that have been constructed to target kallikrein markers, and optionally CA125. 

The invention therefore contemplates an in vivo method comprising administering to a mammal 
imaging agents that carry labels for imaging and that bind to kallikrein markers, and optionally CA125, and 
30 then imaging the mammal. 

Still further the invention relates to therapeutic applications for ovarian cancer employing kallikrein 
markers, and optionally CA125, nucleic acids encoding the polypeptides, and/or agents identified using 
methods of the invention. 

The invention also includes kits for carrying out methods of the invention. In an embodiment, the 
35 kit is for assessing whether a patient is afflicted with ovarian cancer and it comprises reagents for assessing 
kallikrein markers, and optionally CA125, wherein the kallikrein markers comprise or are selected from the 
group consising of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1. 

In another aspect the invention relates to a kit for assessing the suitability of each of a plurality of 
test compounds for inhibiting ovarian cancer in a patient The kit comprises reagents for assessing kallikrein 
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markers, and optionally CA125, wherein the markers comprise or are selected from the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kaliikrein 11. The kit may also 
comprise a plurality of test agents or compounds. 

The invention contemplates a kit for assessing the presence of ovarian cancer cells, wherein the kit 
5 comprises antibodies specific for selected kallikrein markers, and optionally CA125, wherein the markers 
comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11. 

Additionally the invention provides a kit for assessing the ovarian carcinogenic potential of a test 
compound. The kit comprises ovarian cells and reagents for assessing kallikrein markers, and optionally 
10 CA125, wherein the markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 
kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. 

In an aspect the invention provides a method of treating a patient afflicted with ovarian cancer 
comprising providing to cells of a patient antisense oligonucleotides complementary to polynucleotides 
encoding kallikrein markers, and optionally CA125, which are overexpressed in ovarian cancer. In an 
15 alternative method, expression of genes corresponding to kallikrein markers, and optionally CA125, which 
are underexpressed in ovarian cancer are increased. 

The invention relates to a method of inhibiting ovarian cancer in a patient at risk for developing 
ovarian cancer comprising inhibiting or increasing expression (or overexpression) of genes encoding 
kallikrein markers and optionally CA125, wherein the markers comprise or are selected from the group 
20 consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, that are 
either overexpressed or underexpressed, in ovarian cancer. 

Other objects, features and advantages of the present invention will become apparent from the 
following detailed description- It should be understood, however, that the detailed description and the 
specific examples while indicating preferred embodiments of the invention are given by way of illustration 
25 only, since various changes and modifications within the spirit and scope of the invention will become 
apparent to those skilled in the art from this detailed description. 
DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the drawings in which 

Figure 1 is a graph showing hk5 concentration in serum from non-cancer and cancer patients. 
30 Figure 2 is a graph showing hk6 concentration in serum from non-cancer and cancer patients. 

Figure 3 is a graph showing hk7 concentration in serum from non-cancer and cancer patients. 

Figure 4 is a graph showing hk8 concentration in serum from non-cancer and cancer patients. 

Figure 5 is a graph showing hklO concentration in serum from non-cancer and cancer patients. 

Figure 6 is a graph showing hkl 1 concentration in serum from non-cancer and cancer patients. 
35 Figure 7 is a graph showing CA125 concentration in serum from non-cancer and cancer patients. 

Figure 8 is a ROC curve illustrating the added value of using kallikreins and CA 125 together in a 
multivariate function, 

DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to newly discovered correlations between expression of certain markers and 
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ovarian cancer. The combinations of markers described herein may provide sensitive methods for detecting 
ovarian cancer. The levels of expression of a combination of markers described herein may correlate with the 
presence of ovarian cancer or a pre-malignant condition in a patient Methods are provided for detecting the 
presence of ovarian cancer in a sample, the absence of ovarian cancer in a sample, the stage of an ovarian 
5 cancer, the grade of an ovarian cancer, the benign or malignant nature of an ovarian cancer, the metastatic 
potential of an ovarian cancer, assessing the histological type of neoplasm associated with the ovarian 
cancer, the indolence or aggressiveness of the cancer, and other characteristics of ovarian cancer that are 
relevant to prevention, diagnosis, characterization, and therapy of ovarian cancer in a patient Methods are 
also provided for assessing the efficacy of one or more test agents for inhibiting ovarian cancer, assessing the 
10 efficacy of a therapy for ovarian cancer, monitoring the progression of ovarian cancer, selecting an agent or 
therapy for inhibiting ovarian cancer, treating a patient afflicted with ovarian cancer, inhibiting ovarian 
cancer in a patient, and assessing the carcinogenic potential of a test compound. 
Glossary 

The terms "sample", "biological sample", and the like, mean a material known or suspected of 
15 expressing or containing a plurality of kallikrien markers or polypeptides (2, 3, 4, 5, or 6 polypeptides), and 
optionally CA125 polypeptide, or polynucleotides encoding the polypeptides. The test sample can be used 
directly as obtained from the source or following a pretreatment to modify the character of the sample. The 
sample can be derived from any biological source, such as tissues, extracts, or cell cultures, including cells 
(e.g. tumor cells), cell lysates, and physiological fluids, such as, for example, whole blood, plasma, serum, 
20 saliva, ocular lens fluid, cerebral spinal fluid, sweat, urine, milk, ascites fluid, synovial fluid, peritoneal fluid 
and the like. The sample can be obtained from animals, preferably mammals, most preferably humans. The 
sample can be treated prior to use, such as preparing plasma from blood, diluting viscous fluids, and the like. 
Methods of treatment can involve filtration, distillation, extraction, concentration, inactivation of interfering 
components, the addition of reagents, and the like. Nucleic acids and polypeptides may be isolated from the 
25 samples and utilized in the methods of the invention. In a preferred embodiment, the sample is a serum 
sample. 

The term "subject" or "patient" refers to a warm-blooded animal such as a mammal, which is 
suspected of having ovarian cancer, or a condition, disease, or syndrome associated with ovarian cancer. 
Preferably, "subject" refers to a human. 

30 "CA125", "CA125 polypeptide", or "carbohydrate antigen 125" refers to a high-molecular weight 

mucin, which can be defined by its ability to bind to monoclonal antibody OC125. The CA125 protein core 
comprises a short cytoplasmic core tail, a transmembrane domain, and a large and heavily glycosylated 
extracellular domain dominated by a repeat domain of 156 amino acids rich in serine, threonine, and proline 
(Yin BW and Lloyd KO, J Biol Chem. 2001, 276:27371-27375; O'Brian TJ et al, Tumor Biol, 2001 22:348- 

35 366; and Hovig B. et al, Tumor Biol. 2001, 22:345-347). The sequence of CA125 is shown in GenBank 
Accession No. NPJ)78966, AAL65133 and AF414442 (SEQ ID NO. 1). The term includes the native- 
sequence polypeptides, isoforms, precursors and chimeric polypeptides. The term also includes the native 
sequence polypeptide, including polypeptide variants and polypeptides with substantial sequence identity 
(e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 
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99% sequence identity) to the sequence of GenBank Accession No.NP_078966 (SBQ ID NO. 1), and that 
preferably retain the immunogenic activity of the corresponding native sequence polypeptide. 

"Kallikrein polypeptides" or "kallikrein markers" comprise kallilkrein 5, kallikrein 6, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11. The term includes the native-sequence polypeptides, isoforms, 
5 precursors and chimeric polypeptides. The amino acid sequences for native kallikrein polypeptides employed 
in the present invention include the sequences found in GenBank for each polypeptide as shown in Table 1, 
and in SEQ ID NO: 3 (kallilkrein 5), NO.6 (kallikrein 6), NO. 10 (kallikrein 7), NO. 13 (kallikrein 8), NO. 
16 (kallikrein 10), and NOs. 19 and 20 (kallikrein il), or a portion thereof. Other useful polypeptides are 
substantially identical to these sequences (e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 
10 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity), and preferably retain the immunogenic 
activity of the corresponding native-sequence kallikrein polypeptide. 

A "native-sequence polypeptide" comprises a polypeptide having the same amino acid sequence of 
a polypeptide derived from nature. Such native-sequence polypeptides can be isolated from nature or can be 
produced by recombinant or synthetic means. 
15 The term "native-sequence polypeptide" specifically encompasses naturally occurring truncated or 

secreted forms of a polypeptide, polypeptide variants including naturally occurring variant forms (e.g., 
alternatively spliced forms or splice variants), and naturally occurring allelic variants. 

The term "polypeptide variant" means a polypeptide having at least about 70-80%, preferably at 
least about 85%, more preferably at least about 90%, most preferably at least about 95% amino acid 
20 sequence identity with a native-sequence polypeptide, in particular having at least 70-80%, 85%, 90%, 95% 
amino acid sequence identity to the sequences identified in the GenBank Accession Nos. in Table 1 and 
Accession No. NP_078966, AF414442 and AAL65133 and shown in SEQ ID NOS: 1, 2, 3, 6, 10, 13, 16, 19 
and 20. Such variants include, for instance, polypeptides wherein one or more amino acid residues are 
added to, or deleted from, the N- or C-terminus of the full-length or mature sequences of SEQ ID NOS: 1, 2, 
25 3, 6, 10, 13, 16, 19 and 20, including variants from other species, but excludes a native-sequence 
polypeptide. 

An allelic variant may also be created by introducing substitutions, additions, or deletions into a 
nucleic acid encoding a native polypeptide sequence such that one or more amino acid substitutions, 
additions, or deletions are introduced into the encoded protein. Mutations may be introduced by standard 

30 methods, such as site-directed mutagenesis and PCR-mediated mutagenesis. In an embodiment, conservative 
substitutions are made at one or more predicted non-essential amino acid residues. A "conservative amino 
acid substitution" is one in which an animo acid residue is replaced with an amino acid residue with a similar 
side chain. Amino acids with similar side chains are known in the art and include amino acids with basic side 
chains (e.g. Lys, Arg, His), acidic side chains (e.g. Asp, Glu), uncharged polar side chains (e.g. Gly, Asp, 

35 Glu, Ser, Thr, Tyr and Cys), nonpolar side chains (e.g. Ala, Val, Leu, Iso, Pro, Trp), beta-branched side 
chains (e.g. Thr, Val, Iso), and aromatic side chains (e.g. Tyr, Phe, Trp, His). Mutations can also be 
introduced randomly along part or all of the native sequence, for example, by saturation mutagenesis. 
Following mutagenesis the variant polypeptide can be recombinantly expressed and the activity of the 
polypeptide may be determined. 
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Polypeptide variants include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of a native polypeptide which include fewer amino acids than the 
full length polypeptides. A portion of a polypeptide can be a polypeptide which is for example, 10, 15, 20, 
25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100 or more amino acids in length. Portions in which regions of a 
5 polypeptide are deleted can be prepared by recombinant techniques and can be evaluated for one or more 
functional activities such as the ability to form antibodies specific for a polypeptide. 

A naturally occurring allelic variant may contain conservative amino acid substitutions from the 
native polypeptide sequence or it may contain a substitution of an amino acid from a corresponding position 
in a CA125 or kallikrein polypeptide homolog, for example, the murine CA125 or kallikrein polypeptide. 

10 Percent identity of two amino acid sequences, or of two nucleic acid sequences identified herein is 

defined as the percentage of amino acid residues or nucleotides in a candidate sequence that are identical 
with the amino acid residues in a CA125 or kallikrein polypeptide or nucleic acid sequence, after aligning 
the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not 
considering any conservative substitutions as part of the sequence identity. Alignment for purposes of 

15 determining percent amino acid or nucleic acid sequence identity can be achieved in various conventional 
ways, for instance, using publicly available computer software including the GCG program package 
(Devereux J. et al., Nucleic Acids Research 12(1): 387, 1984); BLASTP, BLASTN, and FASTA (Atschul, 
S.F. et al. J. Molec. Biol. 215: 403-410, 1990). The BLAST X program is publicly available from NCBI and 
other sources (BLAST Manual, Altschul, S. et al. NCBI NLM NIH Bethesda, Md. 20894; Altschul, S. et al. 

20 J. Mol. Biol. 215: 403-410, 1990). Skilled artisans can determine appropriate parameters for measuring 
alignment, including any algorithms needed to achieve maximal alignment over the full length of the 
sequences being compared. Methods to determine identity and similarity are codified in publicly available 
computer programs. 

CA125 and kallikrien polypeptides include chimeric or fusion proteins. A "chimeric protein" or 
25 "fusion protein" comprises all or part (preferably biologically active) of a CA125 or kallikrein polypeptide 
operably linked to a heterologous polypeptide (i.e., a polypeptide other than the same CA125 or kallikrein 
polypeptide). Within the fusion protein, the term "operably linked" is intended to indicate that the CA125 or 
kallikrein polypeptide and the heterologous polypeptide are fused in-frame to each other. The heterologous 
polypeptide can be fused to the N-terminus or C-terminus of the CA125 or kallikrein polypeptide. A useful 
30 fusion protein is a GST fusion protein in which a kallikrein polypeptide is fused to the C-terminus of GST 
sequences. Another example of a fusion protein is an immunoglobulin fusion protein in which all or part of a 
CA125 or kallikrein polypeptide is fused to sequences derived from a member of the immunoglobulin 
protein family. Chimeric and fusion proteins can be produced by standard recombinant DNA techniques. 

CA125 and kallikrein polypeptides may be isolated from a variety of sources, such as from human 
35 tissue types or from another source, or prepared by recombinant or synthetic methods, or by any combination 
of these and similar techniques. 

"CA125 polynucleotides" or "polynucleotides encoding CA125" include nucleic acids that encode a 
native-sequence polypeptide, a polypeptide variant including a portion of a CA125 polypeptide, an isoform, 
precursor, and chimeric polypeptide. A nucleic acid sequence encoding native CA125 employed in the 
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present invention includes the nucleic acid sequence in GenBank Accession No. AF414442 and SEQ ID NO. 
2, or a fragment thereof. 

"Kallikrein polynucleotides" or "polynucleotides encoding kallikrein markers/polypeptides" refers 
to kallilkrein 5 nucleic acids (KLK5), kallikrein 6 nucleic acids (KLK6), kallikrein 7 nucleic acids (KLK7), 
5 kallikrein 8 nucleic acids (KLK8), kallikrein 10 nucleic acids (KLK10), and/or kallikrein 11 nucleic acids 
(FCLK11). The term includes nucleic acids that encode a native-sequence polypeptide, a polypeptide variant 
including a portion of a kallikrein polypeptide, an isoform, precursor, and chimeric polypeptide. 

The polynucleotide sequences encoding native kallikrein polypeptides employed in the present 
invention include the nucleic acid sequences of the GenBank Accession Nos. identified in Table 1, and in 
1 0 SEQ ID NOs: 4 and 5 (KLK5), NOs. 7, 8, and 9 (KLK6), NOs. 1 1 and 12 (KLK 7), NOs. 14 and 15 (KLK8), 
NOs. 17 and 18 (KLK10), and NOs. 21 and 22 (KLK1 1), or a fragment thereof. 

Polynucleotides encoding kallikrien polypeptides and CA125 include nucleic acid sequences 
complementary to these polynucleotides, and polynucleotides that are substantially identical to these 
sequences (e.g. at least about 45%, preferably 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%,90%, 95%, 97%, 
15 98%, or 99% sequence identity). 

CA125 and kallikrein polynucleotides also include sequences which differ from a nucleic acid 
sequence of GenBank Accession Nos. identified in Table 1 and SEQ ID NOS: 2, 4, 5, 7, 8, 9, 1 1, 12, 14, 15, 
17, 18, 21, and 22, due to degeneracy in the genetic code. As one example, DNA sequence polymorphisms 
within the nucleotide sequence of a CA125 or kallikrein polypeptide may result in silent mutations which do 
20 not affect the amino acid sequence. Variations in one or more nucleotides may exist among individuals 
within a population due to natural allelic variation. DNA sequence polymorphisms may also occur which 
lead to changes in the amino acid sequence of CA125 or a kallikrein polypeptide. 

CA125 and kallikrein polynucleotides also include nucleic acids that hybridize under stringent 
conditions, preferably high stringency conditions to a nucleic acid sequence of the GenBank Accession Nos. 
25 identified in Table 1 and SEQ ID NOS: 2, 4, 5, 7, 8, 9, 11, 12, 14, 15, 17, 18, 21, and 22. Appropriate 
stringency conditions which promote DNA hybridization are known to those skilled in the art, or can be 
found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For 
example, 6.0 x sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0 x SSC at 
50°C may be employed. The stringency may be selected based on the conditions used in the wash step. By 
30 way of example, the salt concentration in the wash step can be selected from a high stringency of about 0.2 x 
SSC at 50°C. In addition, the temperature in the wash step can be at high stringency conditions, at about 
65°C. 

CA125 and kallikrein polynucleotides also include truncated nucleic acids or fragments and variant 
forms of the polynucleotides that arise by alternative splicing of an mRNA corresponding to a DNA. 
35 The CA125 and kallikrien polynucleotides are intended to include DNA and RNA (e.g. mRNA) and 

can be either double stranded or single stranded. A polynucleotide may, but need not, include additional 
coding or non-coding sequences, or it may, but need not, be linked to other molecules and/or carrier or 
support materials. The polynucleotides for use in the methods of the invention may be of any length suitable 
for a particular method 
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A purality of kallikrein polypeptides or kallikrein polynucleotides are generally detected in the 
present invention. "Plurality" refers to 2, 3, 4, 5, or 6 kallikrein polypeptides or polynucleotides, in particular 
3, 4, 5, or 6, preferably 4, 5, or 6, more preferably 5 or 6 kallikrein polypeptides or polynucleotides. 

In an embodiment a plurality of kallikrein polypeptides is selected from the group consisting of 
5 kallikrein 5, kallikrein 7, and kallikrein 8; kallikrein 5, kallikrein 8, and kallikrein 10; kallikrein 7, kallikrein 
8, and kallikrein 10; kallikrein 5, kallikrein 7, kallikrein 8, and kallikrein 10; kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein 11; or kallikrein 5, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11. In 
another embodiment, a plurality of kallikrein polypeptides is selected from the group consisting of kallilkrein 
5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 and kallikrein 11. 

10 In an embodiment, a pluraity of kallikrein polynucleotides is selected from the group consisting of 

KLK5, KLK7, and KLK8; KLK5, KLK8 and KLK10; KLK7, KLK8 and KLFC10; KLK5, KLK7, KLK8, and 
KLK10; KLK7, KLK8, KLK10 and KLK.11, or KLK5, KLK7, KLK8, KLK10 and KLK11. In another 
embodiment, a plurality of kallikrein polynucleotides is selected from the group consisting of KLK5, KLK6, 
KLK7, KLK8, KLK10, and KLK1 1. 

15 General Methods 

A variety of methods can be employed for the diagnostic and prognostic evaluation of ovarian 
cancer involving kallikrein polypeptides, and optionally CA125 polypeptide, and polynucleotides encoding 
the polypeptides, and the identification of subjects with a predisposition to such disorders. Such methods 
may, for example, utilize polynucleotides encoding kallikrein polypeptides, and optionally CA125, and 

20 fragments thereof, and* binding agents (e.g. antibodies aptamers) against kallikrein polypeptides, and 
optionally CA125 polypeptide, including peptide fragments. In particular, the polynucleotides and antibodies 
may be used, for example, for (1) the detection of either over- or under-expression of kallikrein 
polynucleotides, and optionally CA125, relative to a non-disorder state; and (2) the detection of either an 
over- or an under-abundance of kallikrein polypeptides, and optionally CA125, relative to a non-disorder 

25 state or the presence of modified (e.g., less than full length) kallikrein polypeptides, and optionally CA125, 
that correlate with a disorder state, or a progression toward a disorder state. 

The invention also contemplates a method for detecting ovarian cancer comprising producing a 
profile of levels of a plurality of kallikrein markers, and optionally CA125, in cells from a patient, wherein 
the markers are kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and 

30 comparing the profile with a reference to identify a protein profile for the test cells indicative of disease. 

The methods described herein may be used to evaluate the probability of the presence of malignant 
or pre-malignant cells, for example, in a group of cells freshly removed from a host. Such methods can be 
used to detect tumors, quantitate their growth, and help in the diagnosis and prognosis of disease. The 
methods can be used to detect the presence of cancer metastasis, as well as confirm the absence or removal 

35 of all tumor tissue following surgery, cancer chemotherapy, and/or radiation therapy. They can further be 
used to monitor cancer chemotherapy and tumor reappearance. 

The methods described herein can be adapted for diagnosing and monitoring ovarian cancer by 
detecting a plurality of kallikrein polypeptides, and optionally CA125 polypeptide, or nucleic acids encoding 
the polypeptides in biological samples from a subject These applications require that the amount of 
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polypeptides or nucleic acids quantitated in a sample from a subject being tested be compared to a 
predetermined standard. The standard may correspond to levels quantitated for another sample or an earlier 
sample from the subject, or levels quantitated for a control sample. Levels for control samples from healthy 
subjects or ovarian cancer subjects may be established by prospective and/or retrospective statistical studies. 
5 Healthy or normal subjects who have no clinically evident disease or abnormalities may be selected for 
statistical studies. Diagnosis may be made by a finding of statistically different levels of a plurality of 
kallikrein polypeptides, and optionally CA125, or nucleic acids encoding same, compared to a control 
sample or previous levels quantitated for the same subject. A "significant difference" in levels of kallikrein 
markers or polynucleotides encoding the kallikrein markers in a patient sample compared to a control or 

10 standard (e.g. normal levels or levels in other samples from a patient) may represent levels that are higher or 
lower than the standard error of the detection assay, preferably the levels are at least about 1.5, 2, 3, 4, 5, or 6 
times higher or lower, respectively, than the control or standard. The difference in levels of markers or 
polynucleotides may be a "statistically significant difference" 
Nucleic Acid Methods/Assays 

15 As noted herein an ovarian cancer may be detected based on the levels of polynucleoitdes encoding 

kallikrein polypeptides, and optionally CA125, in a sample. Techniques for detecting polynucleotides such 
as polymerase chain reaction (PCR) and hybridization assays are well known in the art. 

Nucleotide probes for use in the detection of nucleic acid sequences in samples may be constructed 
using conventional methods known in the art. Suitable probes may be based on nucleic acid sequences 

20 encoding at least 5 sequential amino acids from regions of nucleic acids encoding kallikrein polypeptides, 
and optionally CA125, preferably they comprise 15 to 40 nucleotides. A nucleotide probe may be labeled 
with a detectable substance such as a radioactive label that provides for an adequate signal and has sufficient 
half-life such as 32 P, 3 H, l4 C or the like. Other detectable substances that may be used include antigens that 
are recognized by a specific labeled antibody, fluorescent compounds, enzymes, antibodies specific for a 

25 labeled antigen, and luminescent compounds. An appropriate label may be selected having regard to the rate 
of hybridization and binding of the probe to the nucleotide to be detected and the amount of nucleotide 
available for hybridization Labeled probes may be hybridized to nucleic acids on solid supports such as 
nitrocellulose filters or nylon membranes as generally described in Sambrook et al, 1989, Molecular 
Cloning, A Laboratory Manual (2nd ed.). The nucleic acid probes may be used to detect polynucleoitides 

30 encoding kallikrein polypeptides, and optionally CA125, preferably in human cells. The nucleotide probes 
may also be useful in the diagnosis of ovarian cancer involving polynucleoitides encoding kallikrein 
polypeptides, and optionally CA125, in monitoring the progression of such disorder; or monitoring a 
therapeutic treatment. 

Probes may be used in hybridization techniques to detect nucleic acids encoding a plurality of 
35 kallikrein polypeptides, and optionally CA125. The technique generally involves contacting and incubating 
nucleic acids (e.g. recombinant DNA molecules, cloned genes) obtained from a sample from a patient or 
other cellular source with probes under conditions favorable for the specific annealing of the probes to 
complementary sequences in the nucleic acids. After incubation, the non-annealed nucleic acids are 
removed, and the presence of nucleic acids that have hybridized to the probe if any are detected 
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The detection of polynucleotides encoding kallikrein polypeptides and optionally CA125, may 
involve the amplification of specific gene sequences using an amplification method such as polymerase 
chain reaction (PCR), followed by the analysis of the amplified molecules using techniques known to those 
skilled in the art. Suitable primers can be routinely designed by one of skill in the art. 
5 By way of example, oligonucleotide primers may be employed in a PCR based assay to amplify a 

portion of nucleic acids encoding each of a plurality of kallikrein polypeptides, and optionally CA125, 
derived from a sample, wherein the oligonucleotide primers are specific for (i.e. hybridize to) 
polynucleotides encoding each of the plurality of kallikrein polypeptides, and optionally CA125. The 
amplified cDNA is then separated and detected using techniques well known in the art, such as gel 
10 electrophoresis. 

In order to maximize hybridization under assay conditions, primers and probes employed in the 
methods of the invention generally have at least about 60%, preferably at least about 75% and more 
preferably at least about 90% identity to a portion of polynucleotides encoding a plurality of kallikrein 
polypeptides, and CA125. The primers and probes may be at least 10 nucleotides, and preferably at least 20 
15 nucleotides in length. In an embodiment the primers and probes are at least about 10-40 nucleotides in 
length. 

Hybridization and amplification techniques described herein may be used to assay qualitative and 
quantitative aspects of expression of polynucleotides encoding kallikrein polypeptides, and optionally 
CA125. For example, RNA may be isolated from a cell type or tissue known to express these 
20 polynucleotides and tested utilizing the hybridization (e.g. standard Northern analyses) or PCR techniques 
referred to herein. 

The primers and probes may be used in the above-described methods in situ i.e directly on tissue 
sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections. 

In an aspect of the invention, a method is provided employing reverse transcriptase-polymerase 
25 chain reaction (RT-PCR), in which PCR is applied in combination with reverse transcription. Generally, 
RNA is extracted from a sample tissue using standard techniques (for example, guanidine isothiocyanate 
extraction as described by Chomcynski and Sacchi, Anal. Biochem. 162:156-159, 1987) and is reverse 
transcribed to produce cDNA. The cDNA is used as a template for a polymerase chain reaction. The cDNA 
is hybridized to sets of primers specifically designed against each of a plurality of kallikrein polynucleotide 
30 sequences, and optionally CA125. Once the primer and template have annealed a DNA polymerase is 
employed to extend from the primer, to synthesize a copy of the template. The DNA strands are denatured, 
and the procedure is repeated many times until sufficient DNA is generated to allow visualization by 
ethidium bromide staining and agarose gel electrophoresis. 

Amplification may be performed on samples obtained from a subject with suspected ovarian cancer 
35 and an individual who is not afflicted with ovarian cancer. The reaction may be performed on several 
dilutions of cDNA spanning at least two orders of magnitude. A statistically significant difference in 
expression in several dilutions of the subject sample as compared to the same dilutions of the non-cancerous 
sample may be considered positive for the presence of ovarian cancer. 

Oligonucleotides or longer fragments derived from polynucleotides encoding each of a plurality of 
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kallikrein polypeptides and optionally CA125, may be used as targets in a microarray. The microarray can be 
used to simultaneously monitor the expression levels of large numbers of genes. The information from the 
microarray may be used to diagnose a disorder, and to develop and monitor the activities of therapeutic 
agents. 

5 The preparation, use, and analysis of microarrays are well known to a person skilled in the art (See, 

for example, Brennan, T. M. et al. (1995) U.S. Pat No. 5,474,796; Schena, et al. (1996) Proc. Natl, Acad. 
Sci. 93:10614-10619; Baldeschweiler et al. (1995), PCT Application W095/251 1 16; Shalon, D. et al. (1 995) 
PCT application WO95/35505; Heller, R. A. et al. (1997) Proc. Natl. Acad. Sci. 94:2150-2155; and Heller, 
M. J. et al. (1997) U.S. Pat No. 5,605,662.) 

10 Thus, the invention also includes an array comprising a plurality of polynucleotides encoding 

kallikrein marker(s), and optionally CA125 polynucleotides. The array can be used to assay expression of 
kallikrein polynucleotides, and optionally CA125 polynucleotides in the array. The invention allows the 
quantitation of expression of a plurality of kallikrein polynucleotides, and optionally CA125 polynucleotides. 
In an embodiment, the array can be used to monitor the time course of expression of a plurality of 

15 kallikrein polynucleotides, and optionally CA125 polynucleotides, in the array. This can occur in various 
biological contexts such as tumor progression. 

The array is also useful for ascertaining differential expression patterns of a plurality of kallikrein 
polynucleotides and optionally CA125 polynucleotides, in normal and abnormal cells. This provides a 
battery of polynucleotides that could serve as molecular targets for diagnosis or therapeutic intervention. 

20 Protein Methods 

Binding agents specific for a plurality of kallikrein markers and CA125 may be used for a variety of 
diagnostic and assay applications. There are a variety of assay formats known to the skilled artisan for using 
a binding agent to detect a target molecule in a sample. (For example, see Harlow and Lane, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, 1988). In general, the presence or absence of an ovarian 
25 cancer in a subject may be determined by (a) contacting a sample from the subject with binding agents for a 
plurality of kallikrein polypeptides, and optionally CA125; (b) detecting in the sample levels of polypeptides 
that bind to the binding agents; and (c) comparing the levels of polypeptides with a predetermined standard 
or cut-off value. 

"Binding agent" refers to a substance such as a polypeptide or antibody that specifically binds to a 
30 kallikrein or CA125 polypeptide. A substance "specifically binds" to a polypeptide if it reacts at a detectable 
level with the kallikrein or CA125 polypeptide, and does not react detectably with peptides containing 
unrelated sequences or sequences of different polypeptides. Binding properties may be assessed using an 
ELISA, which may be readily performed by those skilled in the art (see for example, Newton et al , Develop. 
Dynamics 197: 1-13, 1993). 

35 A binding agent may be a ribosome, with or without a peptide component, an aptamer, an RNA 

molecule, or a polypeptide. A binding agent may be a polypeptide that comprises a kallikrein polypeptide or 
CA125 polypeptide sequence, a peptide variant thereof, or a non-peptide mimetic of such a sequence. By 
way of example a kallikrein polypeptide sequence may be a peptide portion of a kallikrein polypeptide that is 
capable of modulating a function mediated by the kallikrein polypeptide. 
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An aptamer includes a DNA or RNA molecule that binds to polynucleotides and polypeptides. An 
aptamer that binds to a polypeptide (or binding domain) of a kallikrein polypeptide or a polynucleotide 
encoding a kallikrein polypeptide can be produced using conventional techniques, without undue 
experimentation. [For example, see the following publications describing in vitro selection of aptamers: Klug 
5 et al, Mol. Biol. Reports 20:97-107 (1994); Wallis et al., Chem. Biol. 2:543-552 (1995); Ellington, Curr. 
Biol. 4:427-429 (1994); Lato et al., Chem. Biol. 2:291-303 (1995); Conrad et al., Mol. Div. 1:69-78 (1995); 
and Uphoff et al., Curr. Opin. Struct Biol. 6:281-287 (1996)]. 

In certain other preferred embodiments, the binding agent is an antibody. 

In an aspect the present invention provides a diagnostic method for monitoring or diagnosing 
10 ovarian cancer in a subject by quantitating a plurality of kallikrein polypeptides, and optionally CA125, in a 
biological sample from the subject comprising reacting the sample with antibodies specific for a plurality of 
kallikrein polypeptides, and optionally CA125, which are directly or indirectly labelled with detectable 
substances, and detecting the detectable substances. 

In an aspect of the invention, a method for detecting ovarian cancer is provided comprising: 
15 (a) obtaining a sample suspected of containing a plurality of kallikrein polypeptides, and 

optionally CA125, wherein the kallikrein polypeptides comprise or are selected from the 
group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10 and 
kallikrein 11; 

(b) contacting the sample with antibodies that specifically bind to the plurality of kallikrein 
20 polypeptides, and optionally CA125, under conditions effective to bind the antibodies and 

form complexes; 

(c) measuring the amount of kallikrein polypeptides, and optionally CA125, present in the 
sample by quantitating the amount of the complexes; and 

(d) comparing the amount of kallikrein polypeptides, and optionally CA125, present in the 
25 samples with the amount of polypeptides in a control, wherein a change or significant 

difference in the amount of polypeptides in the sample compared with the amount in the 
control is indicative of ovarian cancer. 
In an embodiment, the invention contemplates a method for monitoring the progression of ovarian 
cancer in an individual, comprising: 
30 (a) contacting antibodies which bind to each of a plurality of kallikrein polypeptides, and 

optionally CA125, with a sample from the individual so as to form binary complexes 
comprising each of the antibodies and polypeptides in the sample; 

(b) determining or detecting the presence or amount of complex formation in the sample; 

(c) repeating steps (a) and (b) at a point later in time; and 

35 (d) comparing the result of step (b) with the result of step (c), wherein a difference in the 

amount of complex formation is indicative of the stage and/or progression of the ovarian 
cancer in said individual. 

The amount of complexes may also be compared to a value representative of the amount of the 
complexes from an individual not at risk of, or afflicted with, ovarian cancer at different stages. 
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Thus, antibodies specifically reactive with each of a plurality of kallikrein polypeptides, and 
CA125, or derivatives, such as enzyme conjugates or labeled derivatives, may be used to detect a plurality of 
kallikrein polypeptides, and optionally CA125, in various samples (e.g. biological materials). They may be 
used as diagnostic or prognostic reagents and they may be used to detect abnormalities in the levels of 
5 expression of a plurality of kallikrein polypeptides, and optionally CA125, or abnormalities in the structure, 
and/or temporal, tissue, cellular, or subcellular location of a plurality of kallikrein polypeptides, and 
optionally CA125. Antibodies may also be used to screen potentially therapeutic compounds in vitro to 
determine their effects on ovarian cancer involving a plurality of kallikrein polypeptides, and optionally 
CA125, and other conditions. In vitro immunoassays may also be used to assess or monitor the efficacy of 

1 0 particular therapies. 

Antibodies may be used in any known immunoassays that rely on the binding interaction between 
antigenic determinants of a plurality of kallikrein polypeptides, and optionally CA125, and the antibodies. 
Examples of such assays are radioimmunoassays, enzyme immunoassays (e.g. ELISA), 
immunofluorescence, immunoprecipitation, latex agglutination, hemagglutination, and histochemical tests. 

15 These terms are well understood by those skilled in the art. A person skilled in the art will know, or can 
readily discern, other immunoassay formats without undue experimentation. 

In particular, the antibodies may be used in immunohistochemical analyses, for example, at the 
cellular and sub-subcellular level, to detect a plurality of kallikrein polypeptides, and optionally CA125, to 
localize them to particular ovarian tumor cells and tissues, and to specific subcellular locations, and to 

20 quantitate the level of expression. 

Antibodies for use in the present invention include monoclonal or polyclonal antibodies, 
immunologically active fragments (e.g. a Fab or (Fab) 2 fragments), antibody heavy chains, humanized 
antibodies, antibody light chains, genetically engineered single chain F v molecules (Ladner et al, U.S. Pat. 
No. 4,946,778), chimeric antibodies, for example, antibodies which contain the binding specificity of murine 

25 antibodies, but in which the remaining portions are of human origin, or derivatives, such as enzyme 
conjugates or labeled derivatives. 

Antibodies including monoclonal and polyclonal antibodies, fragments and chimeras, may be 
prepared using methods known to those skilled in the art. Isolated native or recombinant kallikrein 
polypeptides or CA125 may be utilized to prepare antibodies. See, for example, Kohler et al. (1975) Nature 

30 256:495-497; Kozbor et al (1985) J. Immunol Methods 81:31-42; Cote et al. (1983) Proc Natl Acad Sci 
80:2026-2030; and Cole et al. (1984) Mol Cell Biol 62:109-120 for the preparation of monoclonal 
antibodies; Huse et al. (1989) Science 246:1275-1281 for the preparation of monoclonal Fab fragments; and, 
Pound (1998) Immunochemical Protocols, Humana Press, Totowa, N.J for the preparation of phagemid or B- 
lymphocyte immunoglobulin libraries to identify antibodies. The antibodies specific for kallikrein 

35 polypeptides or CA125 used in the methods of the invention may also be obtained from scientific or 
commercial sources. 

In an embodiment of the invention, antibodies are reactive against kallikrein polypeptides or CA125 
if they bind'with a of greater than or equal to 10* 7 M. 

Antibodies that bind to kallikrein polypeptides or CA125 may be labelled with a detectable 



WO 2004/075713 



PCT/CA2004/000281 



- 18- 

substance and localised in biological samples based upon the presence of the detectable substance. Examples 
of detectable substances include, but are not limited to, the following: radioisotopes (e.g., 3 H, l4 C, 35 S, ,25 I, 
131 1), fluorescent labels (e.g., FITC, rhodamine, lanthanide phosphors), luminescent labels such as luminol, 
enzymatic labels (e.g., horseradish peroxidase, beta-galactosidase, luciferase, alkaline phosphatase, 
5 acetylcholinesterase), biotinyl groups (which can be detected by marked avidin e.g., streptavidin containing a 
fluorescent marker or enzymatic activity that can be detected by optical or colorimetric methods), and 
predetermined polypeptide epitopes recognized by a secondary reporter (e.g., leucine zipper pair sequences, 
binding sites for secondary antibodies, metal binding domains, epitope tags). In some embodiments, labels 
are attached via spacer arms of various lengths to reduce potential steric hindrance. Antibodies may also be 
10 coupled to electron dense substances, such as ferritin or colloidal gold, which are readily visualised by 
electron microscopy. 

Indirect methods may also be employed in which the primary antigen-antibody reaction is amplified 
by the introduction of a second antibody, having specificity for the antibody reactive against a kallikrein 
polypeptide or CA125. The second antibody may be labeled with a detectable substance to detect the 
15 primary antigen-antibody reaction. By way of example, if the antibody having specificity against a kallikrein 
polypeptide is a rabbit IgG antibody, the second antibody may be goat anti-rabbit gamma-globulin labelled 
with a detectable substance as described herein. 

Methods for conjugating or labelling the antibodies discussed above may be readily accomplished 
by one of ordinary skill in the art (See for example Inman, Methods In Enzymology, Vol. 34, Affinity 
20 Techniques, Enzyme Purification: Part B, Jakoby and Wichek (eds.), Academic Press, New York, p. 30, 
1974; and Wilchek and Bayer, "The Avidin-Biotin Complex in Bioanalytical Applications,"Anal. Biochem. 
171:1-32, 1988 re methods for conjugating or labelling the antibodies with enzyme or ligand binding 
partner). 

Cytochemical techniques known in the art for localizing antigens using light and electron 
25 microscopy may be used to detect a plurality of kallikrein polypeptides, and optionally CA125. Generally, 
antibodies may be labeled with detectable substances and kallikrein polypeptides, and optionally CA125, 
may be localised in tissues and cells based upon the presence of the detectable substance. 

In the context of the methods of the invention, the sample, binding agents (e.g. antibodies) for a 
plurality of kallikrein polypeptides, and CA125 may be immobilized on a carrier or support Examples of 
30 suitable carriers or supports are agarose, cellulose, nitrocellulose, dextran, Sephadex, Sepharose, liposomes, 
carboxymethyl cellulose, polyacrylamides, polystyrene, gabbros, filter paper, magnetite, ion-exchange resin, 
plastic film, plastic tube, glass, polyamine-methyl vinyl-ether-maleic acid copolymer, amino acid copolymer, 
ethylene-maleic acid copolymer, nylon, silk, etc. The support material may have any possible configuration 
including spherical (e.g. bead), cylindrical (e.g. inside surface of a test tube or well, or the external surface of 
35 a rod), or flat (e.g. sheet, test strip). Thus, the carrier may be in the shape of, for example, a tube, test plate, 
well, beads, disc, sphere, etc. The immobilized material may be prepared by reacting the material with a 
suitable insoluble carrier using known chemical or physical methods, for example, cyanogen bromide 
coupling. Binding agents (e.g. antibodies) may be indirectly immobilized using second binding agents 
specific for the first binding agent For example, mouse antibodies specific for a kallikrein polypeptide may 
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be immobilized using sheep anti-mouse IgQ Fc fragment specific antibody coated on the carrier or support. 

Where radioactive labels are used as a detectable substance, a plurality of kallikrein polypeptides, 
and optionally CA125, may be localized by radioautography. The results of radioautography may be 
quantitated by determining the density of particles in the radioautographs by various optical methods, or by 
5 counting the grains. 

Time-resolved fluorometry may be used to detect a signal. For example, the method described in 
Christopoulos TK and Diamandis EP Anal Chem 1992:64:342-346 may be used with a conventional time- 
resolved fluorometer. 

Therefore, in accordance with an embodiment of the invention, a method is provided wherein 

10 antibodies specific for each of a plurality of kallikrein polypeptides, and optionally CA125, are labelled with 
enzymes, substrates for the enzymes are added wherein the substrates are selected so that the substrates, or a 
reaction product of the enzymes and substrates, form fluorescent complexes with lanthanide metals. 
Lanthanide metals are added and the plurality of kallikrein polypeptides, and optionally CA125, are 
quantitated in the sample by measuring fluorescence of the fluorescent complexes. Antibodies specific for 

15 CA125 and each of a plurality of kallikrein polypeptides may be directly or indirectly labelled with enzymes. 
Enzymes are selected based on the ability of a substrate of the enzyme, or a reaction product of the enzyme 
and substrate, to complex with lanthanide metals such as europium and terbium. Examples of suitable 
enzymes include alkaline phosphatase and P-galactosidase. 

Examples of enzymes and substrates for enzymes that provide such fluorescent complexes are 

20 described in U.S. Patent No. 5,312,922 to Diamandis. By way of example, when the antibody is directly or 
indirectly labelled with alkaline phosphatase the substrate employed in the method may be 4- 
methylumbelliferyl phosphate, 5-fluorosalicyl phosphate, or diflunisal phosphate. The fluorescence intensity 
of the complexes is typically measured using a time-resolved fluorometer e.g. a CyberFluor 615 
Imunoanalyzer (Nordion International, Kanata, Ontario). 

25 Antibodies specific for a plurality of kallikrein polypeptides and CA125 may also be indirectly 

labelled with enzymes. For example, an antibody may be conjugated to one partner of a ligand binding pair, 
and the enzyme may be coupled to the other partner of the ligand binding pair. Representative examples 
include avidin-biotin, and riboflavin-riboflavin binding protein. In another embodiment, antibodies specific 
for the anti-kallikrein antibodies or anti- CA125 antibodies are labeled with an enzyme. 

30 In accordance with an embodiment, the present invention provides means for determining a 

plurality of kallikrein polypeptides, and optionally CA125, in a sample, in particular a serum sample, by 
measuring a plurality of kallikrein polypeptides, and optionally CA125, by immunoassay. It will be evident 
to a skilled artisan that a variety of immunoassay methods can be used to measure a plurality of kallikrein 
polypeptides and CA125 in serum. In general, an immunoassay method may be competitive or 

35 noncompetitive. Competitive methods typically employ immobilized or immobilizable antibodies to each of 
a plurality of kallikrein polypeptides, and optionally CA125, and a labeled form of each of a plurality of 
kallikrein polypeptides, and optionally CA125. Kallikrein polypeptides and CA125 and labeled kallikrein 
polypeptides and CA125 compete for binding to anti-kallikrein antibodies and anti-CA125 antibodies. After 
separation of the resulting labeled kallikrein polypeptides and CA125 that have become bound to anti- 
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kallikrein polypeptides and anti- CA125 (bound fraction) from that which has remained unbound (unbound 
fraction), the amount of the label in either bound or unbound fraction is measured and may be correlated 
with the amount of kallikrein polypeptides, and optionally CA125, in the test sample in any conventional 
manner, e.g., by comparison to a standard curve. 
5 In an aspect, a non-competitive method is used for the determination of a plurality of kallikrein 

polypeptides, and optionally CA125, with the most common method being the "sandwich" method. In this 
assay, two types of antibodies specific for each of a plurality of kallikrein polypeptides, and optionally 
CA125 are employed. One type of antibody is directly or indirectly labeled (sometimes referred to as the 
"detection antibody") and the other is immobilized or immobilizable (sometimes referred to as the "capture 

10 antibody"). The capture and detection antibodies can be contacted simultaneously or sequentially with a test 
sample. Sequential methods can be accomplished by incubating capture antibodies with the sample, and 
adding the detection antibodies at a predetermined time thereafter (sometimes referred to as the "forward" 
method); or the detection antibodies can be incubated with the sample first and then the capture antibodies 
added (sometimes referred to as the "reverse" method). After the necessary incubation(s) have occurred, to 

15 complete the assay, the capture antibodies are separated from the liquid test mixture, and labels are measured 
in at least a portion of the separated capture antibody phase or the remainder of the liquid test mixture. 
Generally the labels are measured in the capture antibody phase since it comprises kallikrein polypeptides, 
and optionally CA125, bound by ("sandwiched" between) the capture and detection antibodies. In an 
embodiment, the label may be measured without separating the capture antibodies and liquid test mixture. 

20 In a typical two-site immunometric assay for a plurality of kallikrein polypeptides, and optionally 

CA125, one or both of the capture and detection antibodies are polyclonal antibodies or one or both of the 
capture and detection antibodies are monoclonal antibodies (i.e. polyclonal/polyclonal, 
monoclonal/monoclonal, or monoclonal/polyclonal). The labels used with the detection antibodies can be 
selected from any of those known conventionally in the art. The labels may be an enzyme or a 

25 chemiluminescent moiety, but it can also be a radioactive isotope, a fluorophor, a detectable ligand (e.g., 
detectable by a secondary binding by a labeled binding partner for the ligand), and the like. Preferably 
antibodies are labelled with enzymes which are detected by adding substrates that are selected so that a 
reaction product of the enzymes and substrates forms fluorescent complexes. Capture antibodies may be 
selected so that they provide a means for being separated from the remainder of the test mixture. 

30 Accordingly, the capture antibodies can be introduced to the assay in an already immobilized or insoluble 
form, or can be in an immobilizable form, that is, a form which enables immobilization to be accomplished 
subsequent to introduction of the capture antibodies to the assay. An immobilized capture antibody may 
comprise an antibody covalently or noncovalently attached to a solid phase such as a magnetic particle, a 
latex particle, a microtiter plate well, a bead, a cuvette, or other reaction vessel. An example of an 

35 immobilizable capture antibody is antibody which has been chemically modified with a ligand moiety, e.g., a 
hapten, biotin, or the like, and which can be subsequently immobilized by contact with an immobilized form 
of a binding partner for the ligand, e.g., an antibody, avidin, or the like. In an embodiment, a capture 
antibody may be immobilized using a species specific antibody for the capture antibody that is bound to the 
solid phase. 
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A particular sandwich immunoassay method of the invention employs two types of antibodies, first 
antibodies are reactive against each of a plurality of kallikrein polypeptides, and optionally CA125, and 
second antibodies having specificity against antibodies reactive against each of a plurality of kallikrein 
polypeptides, and optionally CA125, labelled with enzymatic labels, and fluorogenic substrates for the 
5 enzymes. An enzyme may be alkaline phosphatase (ALP) and the substrate is 5-fluorosaIicyl phosphate. 
ALP cleaves phosphate out of the fluorogenic substrate, 5-fluorosalicyl phosphate, to produce 5- 
fluorosalicylic acid (FSA). 5-Fluorosalicylic acid can then form a highly fluorescent ternary complex of the 
form FSA-Tb(3+)-EDTA, which can be quantified by measuring the Tb3+ fluorescence in a time-resolved 
mode. Fluorescence intensity is measured using a time-resolved fluorometer as described herein. 
10 The above-described immunoassay methods and formats are intended to be exemplary and are not 

limiting. 

Computer Systems 

Computer readable media comprising a plurality of kallikrein markers, and optionally CA125, is 
also provided. "Computer readable media" refers to any medium that can be read and accessed directly by a 
15 computer, including but not limited to magnetic storage media, such as floppy discs, hard disc storage 
medium, and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. Thus, the invention 
contemplates computer readable medium having recorded thereon markers identified for patients and 
controls. 

20 "Recorded" refers to a process for storing information on computer readable medium. The skilled 

artisan can readily adopt any of the presently known methods for recording information on computer 
readable medium to generate manufactures comprising information on a plurality of kallikrein markers, and 
optionally CA125. 

A variety of data processor programs and formats can be used to store information on a plurality of 
25 kallikrein markers, and optionally CA125, on computer readable medium. For example, the information can 
be represented in a word processing text file, formatted in commercially-available software such as 
WordPerfect and MicroSoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like. Any number of dataprocessor structuring formats (e.g., 
text file or database) may be adapted in order to obtain computer readable medium having recorded thereon 
30 the marker information. 

By providing the marker information in computer readable form, one can routinely access the 
information for a variety of purposes. For example, one skilled in the art can use the information in computer 
readable form to compare marker information obtained during or following therapy with the information 
stored within the data storage means. 
35 The invention provides a medium for holding instructions for performing a method for determining 

whether a patient has ovarian cancer or a pre-disposition to ovarian cancer, comprising determining the 
presence or absence of a plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same, and based on the presence or absence of the plurality of kallikrein markers, optionally CA125, and/or 
polynucleotides encoding same, determining whether the patient has ovarian cancer or a pre-disposition to 
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ovarian cancer, and optionally recommending treatment for the ovarian cancer or p re-ovarian cancer 
condition. 

The invention also provides in an electronic system and/or in a network, a method for determining 
whether a subject has ovarian cancer or a pre-disposition to ovarian cancer associated with a plurality of 
5 kallikrein markers, and optionally CA125, and/or polynucleotides encoding same, comprising determining 
the presence or absence of a plurality of kallikrein markers, and optionally CA125, and/or polynucleotides 
encoding same, and based on the presence or absence of the plurality of kallikrein markers, and optionally 
CA125, and/or polynucleotides encoding same, determining whether the subject has ovarian cancer or a pre- 
disposition to ovarian cancer, and optionally recommending treatment for the ovarian cancer or pre-ovarian 
10 cancer condition. 

The invention further provides in a network, a method for determining whether a subject has 
ovarian cancer or a pre-disposition to ovarian cancer associated with a plurality of kallikrein markers, 
optionally CA125 and/or polynucleotides encoding same, comprising: (a) receiving phenotypic information 
on the subject and information on a plurality of kallikrein markers, optionally CA125 and/or polynucleotides 

15 encoding same associated with samples from the subject; (b) acquiring information from the network 
corresponding to the plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same; and (c) based on the phenotypic information and information on the plurality of kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, determining whether the subject has ovarian 
cancer or a pre-disposition to ovarian cancer; and (d) optionally recommending treatment for the ovarian 

20 cancer or pre-ovarian cancer condition. 

The invention still further provides a system for identifying selected records mat identify an ovarian 
cancer cell. A system of the invention generally comprises a digital computer; a database server coupled to 
the computer; a database coupled to the database server having data stored therein, the data comprising 
records of data comprising a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 

25 encoding same, and a code mechanism for applying queries based upon a desired selection criteria to the data 
file in the database to produce reports of records which match the desired selection criteria. 

In an aspect of the invention a method is provided for detecting an ovarian cancer cell using a 
computer having a processor, memory, display, and input/output devices, the method comprising the steps 
of: 

30 (a) creating records of a plurality of kallikrein markers, optionally CA125, and/or 

polynucleotides encoding same, isolated from a sample suspected of containing an ovarian 
cancer cell; 

(b) providing a database comprising records of data comprising a plurality of kallikrein 
markers, optionally CA125, wherein the markers are kallikrein 5, kallikrein 6, kallikrein 7, 

35 kallikrein 8, kallikrein 10, and kallikrein 11, and/or comprising polynucleotides encoding 

same; and 

(c) using a code mechanism for applying queries based upon a desired selection criteria to the 
data file in the database to produce reports of records of step (a) which provide a match of 
the desired selection criteria of the database of step (b) the presence of a match being a 
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positive indication that the markers of step (a) have been isolated from a cell that is an 
ovarian cancer cell. 

The invention contemplates a business method for determining whether a subject has ovarian cancer 
or a pre-disposition to ovarian cancer associated with a plurality of kallikrcin markers, optionally CA125, 
5 and/or polynucleotides encoding same, comprising: (a) receiving phenotypic information on the subject and 
information on a plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same, 
associated with samples from the subject; (b) acquiring information from a network corresponding to the 
plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same; and (c) based on 
the phenotypic information, information on a plurality of kallikrein markers, optionally CA125, and/or 
10 polynucleotides encoding same, and acquired information, determining whether the subject has ovarian 
cancer or a pre-disposition to ovarian cancer; and (d) optionally recommending treatment for the ovarian 
cancer or pre-ovarian cancer condition. 
Imaging Methods 

Antibodies specific for each of a plurality of kallikrein polypeptides, and optionally CA125, may 
15 also be used in imaging methodologies in the management of ovarian cancer. The invention provides a 
method for imaging tumors associated with a plurality of kallikrein polypeptides, and optionally CA125. 

In an embodiment the method is an in vivo method and a subject or patient is administered imaging 
agents that carry imaging labels and are capable of targeting or binding to each of a plurality of kallikrein 
polypeptides, and optionally CA125. In the method each imaging agent is labeled so that it can be 
20 distinguished during the imaging. The imaging agents are allowed to incubate in vivo and bind to the 
plurality of kallikrein polypeptides, and optionally CA125, associated with ovarian tumors. The presence of 
label is localized to the ovarian cancer, and the localized label is detected using imaging devices known to 
those skilled in the art 

The imaging agents may be antibodies or chemical entities that recognize the plurality of kallikrein 
25 polypeptides, and optionally CA125. In an aspect of the invention an imaging agent is a polyclonal antibody 
or monoclonal antibody, or fragments thereof, or constructs thereof including but not limited to, single chain 
antibodies, Afunctional antibodies, molecular recognition units, and peptides or entities that mimic peptides. 
The antibodies specific for kallikrein polypeptides and CA125 used in the methods of the invention may be 
obtained from scientific or commercial sources, or isolated native or recombinant kallikrein and CA125 
30 polypeptides may be utilized to prepare antibodies etc as described herein. 

An imaging agent may be a peptide that mimics the epitope for an antibody specific for kallikrein 
polypeptide or CA125 and binds to kallikrein polypeptide or CA125. The peptide may be produced on a 
commercial synthesizer using conventional solid phase chemistry. By way of example, a peptide may be 
prepared that includes either tyrosine, lysine, or phenylalanine to which N 2 S 2 chelate is complexed (See U.S. 
35 Patent No. 4,897,255). The anti-kallikrein peptide conjugate is then combined with a radiolabel (e.g. sodium 
99ro Tc pertechnetate or sodium l88 Re perrhenate) and it may be used to locate a tumor producing a plurality 
of kallikrein polypeptides, and optionally CA125. 

Imaging agents carry labels to image the plurality of kallikrein polypeptides and CA125. Agents 
may be labelled for use in radionuclide imaging. In particular, agents may be directly or indirectly labelled 
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with a radioisotope. Examples of radioisotopes that may be used in the present invention are the following: 
277 Ac, 2n At, ,28 Ba, m Ba, 7 Be, 204 Bi, 205 Bi, 206 Bi, 76 Br, "Br, 82 Br, I09 Cd, < 7 Ca, n C, l4 C, 36 C1, « 8 Cr, 51 Cr, 62 Cu, 
"Cu, 67 Cu, l65 Dy, 155 Eu, !8 F, ,53 Gd, « 6 Ga, 67 Ga, "Ga, 72 Ga, 198 Au, 3 H, ,66 Ho, ,n In, n3tn In, ll5m In, 123 I, ,25 I, 
l89 Ir, 191m Ir, I92 Ir, ,94 Ir, 52 Fe, 55 Fe, 59 Fe, 177 Lu, ,5 0, ,9,m - ,91 0s, !09 Pd, 32 P, 33 P, 42 K, ^Ra, lw Re, l88 Re, 82m Rb] 
5 ,53 Sm, 46 Sc, 47 Sc, "Se, 75 Se, l05 Ag, 22 Na, "Na, "Sr, 35 S, 38 S, 177 Ta, ^c, "To, " l T\, 202 T1, n3 Sn, H7ni Sn, 
,21 Sn, ,66 Yb, l69 Yb, 175 Yb, 88 Y, 90 Y, "Zn and 65 Zn. Preferably the radioisotope is m I, i25 I t ,23 I, ,n I, ""Tc, 
90 Y, l86 Re, ,88 Re, 32 P, I53 Sm, ^Ga, 20I T1 "Br, or ,8 F, and it is imaged with a photoscanning device. 

Procedures for labeling biological agents with the radioactive isotopes are generally known in the 
art. U.S. Pat. No. 4,302,438 describes tritium labeling procedures. Procedures for iodinating, tritium labeling, 
10 and 35 S labeling especially adapted for murine monoclonal antibodies are described by Goding, J. W. (supra, 
pp 124-126) and the references cited therein. Other procedures for iodinating biological agents, such as 
antibodies, binding portions thereof, probes, or ligands, are described in the scientific literature (see Hunter 
and Greenwood, Nature 144:945 (1962), David et al., Biochemistry 13:1014-1021 (1974), and U.S. Pat. Nos. 
3,867,517 and 4,376,110). Iodinating procedures for agents are described by Greenwood, F. et al., Biochem. 
15 J. 89:114-123 (1963); Marchalonis, J., Biochem. J. 113:299-305 (1969); and Morrison, M. et al, 
Immunochemistry, 289-297 (1971). " m Tc-labeling procedures are described by Rhodes, B. et al. in 
Burchiel, S. et al. (eds.), Tumor Imaging: The Radioimmunochemical Detection of Cancer, New York: 
Masson 111-123 (1982) and the references cited therein. Labelling of antibodies or fragments with 
technetium-99m are also described for example in U.S. Pat. No. 5,317,091, U.S. Pat. No. 4,478,815, U.S. 
20 Pat. No. 4,478,818, U.S. Pat. No. 4,472,371, U.S. Pat. No. Re 32,417, and U.S. Pat. No. 4,311,688. 
Procedures suitable for 1,1 In-labeling biological agents are described by Hnatowich, D. J. et al., J. Immul. 
Methods, 65:147-157 (1983), Hnatowich, D. et al., J. Applied Radiation, 35:554-557 (1984), and Buckley, 
R. G. et al., F.B.B.S. 166:202-204 (1984). 

An imaging agent may also be labeled with a paramagnetic isotope for purposes of an in vivo 
25 method of the invention. Examples of elements that are useful in magnetic resonance imaging include 
gadolinium, terbium, tin, iron, or isotopes thereof. (See, for example, Schaefer et al., (1989) JACC 14, 472- 
480; Shreve et al., (1986) Magn. Reson. Med. 3, 336-340; Wolf, G L., (1984) Physiol. Chem. Phys. Med. 
NMR 16, 93-95; Wesbey et al., (1984) Physiol. Chem. Phys. Med. NMR 16, 145-155; Runge et al., (1984) 
Invest. Radiol. 19, 408-415 for discussions on in vivo nuclear magnetic resonance imaging.) 
30 m the case of radiolabeled agents, the agents may be administered to the patient, localized to the 

tumor having a plurality of kallikrein polypeptides, and optionally CA125, with which the agents bind, and 
detected or "imaged" in vivo using known techniques such as radionuclear scanning using, for example, a 
gamma camera or emission tomography. [See for example, A. R. Bradwell et al., "Developments in 
Antibody Imaging", Monoclonal Antibodies for Cancer Detection and Therapy, R. W. Baldwin et al., (eds.), 
35 pp. 65-85 (Academic Press 1985)]. A positron emission transaxial tomography scanner, such as designated 
Pet VI located at Brookhaven National Laboratory, can also be used where the radiolabel emits positrons 
(e.g., u C, 18 F, 15 O, and 13 N). 

Whole body imaging techniques using radioisotope labeled agents can be used for locating both 
primary tumors and tumors which have metastasized. Antibodies specific for a plurality of kallikrein 
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polypeptides, and optionally CA125, or fragments thereof having the same epitope specificity, are bound to a 
suitable radioisotope, or a combination thereof, and administered parenterally. For ovarian cancer, 
administration preferably is intravenous. The bio-distribution of the labels can be monitored by scintigraphy, 
and accumulations of the labels can be related to the presence of ovarian cancer cells. Whole body imaging 
5 techniques are described in U.S. Pat. Nos. 4,036,945 and 4,311,688. Other examples of agents useful for 
diagnosis and therapeutic use that can be coupled to antibodies and antibody fragments include 
metallothionein and fragments (see, U.S. Pat No. 4,732,864). These agents are useful in diagnosis, staging 
and visualization of cancer, in particular ovarian cancer, so that surgical and/or radiation treatment protocols 
can be used more efficiently. 

1 0 Screening Methods 

The invention also contemplates methods for evaluating test agents or compounds for their ability to 
inhibit ovarian cancer or potentially contribute to ovarian cancer. Test agents and compounds include but 
are not limited to peptides such as soluble peptides including Ig-tailed fusion peptides, members of random 
peptide libraries and combinatorial chemistry-derived molecular libraries made of D- and/or L-configuration 

15 amino acids, phosphopeptides (including members of random or partially degenerate, directed 
phosphopeptide libraries), antibodies [e.g. polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, 
single chain antibodies, fragments, (e.g. Fab, F(ab)2, and Fab expression library fragments, and epitope- 
binding fragments thereof)], nucleic acids (e.g. antisense, interference RNA) and small organic or inorganic 
molecules. The agents or compounds may be endogenous physiological compounds or natural or synthetic 

20 compounds. 

The invention also provides a method for assessing the potential efficacy of a test agent for 
inhibiting ovarian cancer in a patient, the method comprising comparing: 

(a) levels of a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in a first sample obtained from a patient and exposed to the test agent, 

25 wherein the markers comprise or are selected from the group consisting of kallikrein 5, 

kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and 

(b) levels of the plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in a second sample obtained from the patient, wherein the sample is not 
exposed to the test agent, wherein a significant difference in the levels of expression of a 

30 plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding same, 

in the first sample, relative to the second sample, is an indication that the test agent is 
potentially efficacious for inhibiting ovarian cancer in the patient. 
The first and second samples may be portions of a single sample obtained from a patient or portions 
of pooled samples obtained from a patient 
35 In an aspect, the invention provides a method of selecting an agent for inhibiting ovarian cancer in a 

patient comprising: 

(a) obtaining a sample comprising cancer cells from the patient; 

(b) separately maintaining aliquots of the sample in the presence of a plurality of test agents; 

(c) comparing a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
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encoding same, in each of the aliquots, wherein the markers comprise or are selected from 
the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, 
and kallikrein 11; and 

(d) selecting one of the test agents which alters the levels of the kallikrein markers, optionally 
5 CA125, and/or polynucleotides encoding same, in the aliquot containing that test agent, 

relative to other test agents. 
Still another aspect of the present invention provides a method of conducting a drug discovery 
business comprising: 

(a) providing one or more methods or assay systems for identifying agents that inhibit ovarian 
10 cancer in a patient; 

(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, 
for efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical preparation including one or more agents identified in step 
(b) as having an acceptable therapeutic profile. 

15 In certain embodiments, the subject method can also include a step of establishing a distribution 

system for distributing the pharmaceutical preparation for sale, and may optionally include establishing a 
sales group for marketing the pharmaceutical preparation. 

The invention also contemplates a method of assessing the ovarian carcinogenic potential of a test 
compound comprising: 

20 (a) maintaining separate aliquots of ovarian cells in the presence and absence of the test 

compound; and 

(b) comparing a plurality of kallikrein markers, optionally CA125, and/or polynucleotides 
encoding same, in each of the aliquots, wherein the markers comprise or are selected from 
the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, 
25 and kallikrein 11. 

A significant difference between the levels of the markers in the aliquot maintained in the presence 
of (or exposed to) the test compound relative to the aliquot maintained in the absence of the test compound, 
indicates that the test compound possesses ovarian carcinogenic potential. 
Kits 

30 The methods described herein may be performed by utilizing pre-packaged diagnostic kits 

comprising at least a plurality of kallikrein nucleic acids or binding agents (e.g. antibodies) or CA125 nucleic 
acids or binding agents described herein, which may be conveniently used, e.g., in clinical settings, to screen 
and diagnose patients, and to screen and identify those individuals afflicted with or exhibiting a 
predisposition to ovarian cancer. 

35 Thus, the invention also contemplates kits for carrying out the methods of the invention. Such kits 

typically comprise two or more components required for performing a diagnostic assay. Components include 
but are not limited to compounds, reagents, containers, and/or equipment. 

In an embodiment, a container with a kit comprises binding agents as described herein. By way of 
example, the kit may contain antibodies specific for a plurality of kallikrein polypeptides, and optionally 
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CA125, antibodies against the antibodies labelled with enzymes; and substrates for the enzymes. The kit may 
also contain microtiter plate wells, standards, assay diluent, wash buffer, adhesive plate covers, and/or 
instructions for carrying out a method of the invention using the kit 

In an aspect of the invention, the kit includes antibodies or antibody fragments which bind 
5 specifically to epitopes of each of a plurality of kallikrein polypeptides, and optionally CA125, and means 
for detecting binding of the antibodies to epitopes associated with tumor cells, either as concentrates 
(including lyophilized compositions), which may be further diluted prior to use or at the concentration of 
use, where the vials may include one or more dosages. Where the kits are intended for in vivo use, single 
dosages may be provided in sterilized containers, having the desired amount and concentration of agents. 

10 Containers that provide a formulation for direct use, usually do not require other reagents, as for example, 
where the kit contains radiolabeled antibody preparations for in vivo imaging. 

A kit may be designed to detect the level of polynucleotides encoding kallikrein polypeptides, and 
optionally CA125 polynucleotides, in a sample. Such kits generally comprise oligonucleotide probes or 
primers, as described herein, that hybridize to a plurality of polynucleotides encoding kallikrein polypeptides 

15 and optionally CA125. Such oligonucleotides may be used, for example, within a PCR or hybridization 
procedure. Additional components that may be present within the kits include second oligonucleotides and/or 
diagnostic reagents to facilitate detection of a plurality polynucleotides encoding kallikrein polypeptides, and 
optionally CA125 polynucleotides. 

The reagents suitable for applying the screening methods of the invention to evaluate compounds 

20 may be packaged into convenient kits described herein providing the necessary materials packaged into 
suitable containers. 
Applications 

Kallikrein polypeptides (in particular, kallikrein 5, 6, 10 and 11), optionally in combination with 
CA125, are targets for ovarian cancer immunotherapy. Such immunotherapeutic methods include the use of 

25 antibody therapy, in vivo vaccines, and ex vivo immunotherapy approaches. 

In one aspect, the invention provides antibodies specific for a plurality of kallikrein polypeptides 
(for example, kallikreins 5, 6, 10 and 11) and optionally CA125, that may be used systemically to treat 
ovarian cancer. Preferably antibodies are used that target the tumor cells but not the surrounding non-tumor 
cells and tissue. Thus, the invention provides a method of treating a patient susceptible to, or having a cancer 

30 that expresses a plurality of kallikrein polypeptides, and optionally CA125, comprising administering to the 
patient an effective amount of antibodies that bind specifically to a plurality of kallikrein polypeptides, and 
optionally CA125. In another aspect, the invention provides a method of inhibiting the growth of tumor cells 
expressing a plurality of kallikrein polypeptides, and optionally CA125, comprising administering to a 
patient antibodies which bind specifically to the plurality of kallikrein polypeptides, and optionally CA125, 

35 in amounts effective to inhibit growth of the tumor cells. Antibodies specific for a plurality of kallikrein 
polypeptides, and optionally CA125, may also be used in a method for selectively inhibiting the growth of, 
or killing a cell expressing a plurality of kallikrein polypeptides, and optionally CA125, comprising reacting 
antibody immunoconjugates or immunotoxins with the cell in an amount sufficient to inhibit the growth of, 
or kill the cell. 
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By way of example, unconjugated antibodies specific for a plurality of kallikrein polypeptides, and 
optionally CA125, may be introduced into a patient such that the antibodies bind to cancer cells expressing a 
plurality of kallikrein polypeptides, and optionally CA125, and mediate growth inhibition of such cells 
(including the destruction thereof), and the tumor, by mechanisms which may include complement-mediated 
5 cytolysis, antibody-dependent cellular cytotoxicity, altering the physiologic function of a plurality of 
kallikrein polypeptides, and optionally CA125, and/or the inhibition of ligand binding or signal transduction 
pathways. In addition to unconjugated antibodies, antibodies specific for a plurality of kallikrein 
polypeptides, and optionally CA125, conjugated to therapeutic agents (e.g. immunoconjugates) may also be 
used therapeutically to deliver the agents directly to tumor cells expressing a plurality of kallikrein 

10 polypeptides, and optionally CA125, and thereby destroy the tumor. Examples of such agents include abrin, 
ricin A, Pseudomonas exotoxin, or diphtheria toxin, proteins such as tumor necrosis factor, alpha-interferon, 
beta-interferon, nerve growth factor, platelet derived growth factor, tissue plasminogen activator, and 
biological response modifiers such as lymphokines, interleukin-1, interleukin-2, interleukin-6, granulocyte 
macrophage colony stimulating factor, granulocyte colony stimulating factor, or other growth factors. 

15 Cancer immunotherapy using antibodies specific for a plurality of kallikrein polypeptides, and 

optionally CA125, may utilize the various approaches that have been successfully employed for cancers, 
including but not limited to colon cancer (Arlen et al., 1998, Crit Rev Immunol 18: 133-138), multiple 
myeloma (Ozaki et al., 1997, Blood 90: 3179-3186; Tsunenati et al., 1997, Blood 90: 2437-2444), gastric 
cancer (Kasprzyk et al., 1992, Cancer Res 52: 2771-2776), B-cell lymphoma (Funakoshi et al., 1996, J 

20 Immunther Emphasis Tumor Immunol 19: 93-101), leukemia (Zhong et al, 1996, Leuk Res 20: 581-589), 
colorectal cancer (Moun et al., 1994, Cancer Res 54: 6160-6166); Velders et al., 1995, Cancer Res 55: 4398- 
4403), and breast cancer (Shepard et al, 1991, J Clin Immunol 1 1: 117-127). 

In the practice of a method of the invention, antibodies specific for a plurality of kallikrein 
polypeptides, optionally in combination with antibodies specific for CA125, capable of inhibiting the growth 

25 of cancer cells expressing a plurality of kallikrein polypeptides, and optionally CA125, are administered in a 
therapeutically effective amount to cancer patients whose tumors express or overexpress a plurality of 
kallikrein polypeptides, and optionally CA125. The invention may provide a specific, effective and long- 
needed treatment for ovarian cancer. The antibody therapy methods of the invention may be combined with 
other therapies including chemotherapy and radiation. 

30 Patients may be evaluated for the presence and levels of a plurality of kallikrein polypeptides, and 

optionally CA125, expression and overexpression in tumors, preferably using immunohistochemical 
assessments of tumor tissue, quantitative imaging as described herein, or other techniques capable of reliably 
indicating the presence and degree of expression of a plurality of kallikrein polypeptides, and optionally 
CA125. Immunohistochemical analysis of tumor biopsies or surgical specimens may be employed for this 

35 purpose. 

Antibodies specific for a plurality of kallikrein polypeptides and CA125 useful in treating cancer 
include those that are capable of initiating a potent immune response against the tumor and those that are 
capable of direct cytotoxicity. In this regard, the antibodies may elicit tumor cell lysis by either complement- 
mediated or antibody-dependent cell cytotoxicity (ADCC) mechanisms, both of which require an intact Fc 
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portion of the immunoglobulin molecule for interaction with effector cell Fc receptor sites or complement 
proteins. In addition, antibodies specific for a plurality of kallikrein polypeptides and CA125 that exert a 
direct biological effect on tumor growth are useful in the practice of the invention. Such antibodies may not 
require the complete immunoglobulin to exert the effect Potential mechanisms by which such directly 

5 cytotoxic antibodies may act include inhibition of cell growth, modulation of cellular differentiation, 
modulation of tumor angiogenesis factor profiles, and the induction of apoptosis. The mechanism by which a 
particular antibody exerts an anti-tumor effect may be evaluated using any number of in vitro assays 
designed to determine ADCC, antibody-dependent macrophage-mediated cytotoxicity (ADMMC), 
complement-mediated cell lysis, and others known in the art. 

10 The anti-tumor activity of a combination of antibodies specific for a plurality of kallikrein 

polypeptides and optionally CA125, may be evaluated in vivo using a suitable animal model. Xenogenic 
cancer models, wherein human cancer explants or passaged xenograft tissues are introduced into immune 
compromised animals, such as nude or SCID mice, may be employed. 

The methods of the invention contemplate the admimstration of combinations, or "cocktails" of 

15 different individual antibodies recognizing epitopes of a plurality of kallikrein polypeptides, and optionally" 
CA125. Such cocktails may have certain advantages inasmuch as they contain antibodies that bind to 
different epitopes and/or exploit different effector mechanisms or combine directly cytotoxic antibodies with 
antibodies that rely on immune effector functionality. Such antibodies in combination may exhibit 
synergistic therapeutic effects. In addition, the administration of the antibodies may be combined with other 

20 therapeutic agents, including but not limited to chemotherapeutic agents, androgen-blockers, and immune 
modulators (e.g., IL2, GM-CSF). The antibodies may be administered in their "naked" or unconjugated form, 
or may have therapeutic agents conjugated to them. 

The antibodies specific for a plurality of kallikrein polypeptides and optionally CA125, used in the 
practice of the method of the invention may be formulated into pharmaceutical compositions comprising a 

25 carrier suitable for the desired delivery method. Suitable carriers include any material which when combined 
with the antibodies retains the anti-tumor function of the antibodies and is non-reactive with the subject's 
immune systems. Examples include any of a number of standard pharmaceutical carriers such as sterile 
phosphate buffered saline solutions, bacteriostatic water, and the like (see, generally, Remington's 
Pharmaceutical Sciences 16.sup.th Edition, A. Osal., Ed., 1980). 

30 Antibody formulations may be administered via any route capable of delivering the antibodies to 

the tumor site. Routes of administration include, but are not limited to, intravenous, intraperitoneal, 
intramuscular, intratumor, intradermal, and the like. Preferably, the route of administration is by intravenous 
injection. Antibody preparations may be lyophilized and stored as a sterile powder, preferably under 
vacuum, and then reconstituted in bacteriostatic water containing, for example, benzyl alcohol preservative, 

35 or in sterile water prior to injection. 

Treatment will generally involve the repeated administration of the antibody preparation via an 
acceptable route of administration such as intravenous injection (IV), at an effective dose. Dosages will 
depend upon various factors generally appreciated by those of skill in the art, including the type of cancer 
and the severity, grade, or stage of the cancer, the binding affinity and half life of the antibodies used, the 
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degree of expression of a plurality of kallikrein polypeptides, and optionaUy CA125, in the patient, the extent 
of circulating kallikrein polypeptide antigens, and optionally CA125 antigens, the desired steady-state 
antibody concentration level, frequency of treatment, and the influence of any chemotherapeutic agents used 
in combination with a treatment method of the invention. 
5 Daily doses may range from about 0.1 to 100 mg/kg. Doses in the range of 10-500 mg antibodies 

per week may be effective and well tolerated, although even higher weekly doses may be appropriate and/or 
well tolerated. A determining factor in defining the appropriate dose is the amount of antibodies necessary to 
be therapeutically effective in a particular context Repeated administrations may be required to achieve 
tumor inhibition or regression. Direct administration of antibodies specific for a plurality of kallikrein 

10 polypeptides and optionally CA125 is also possible and may have advantages in certain situations. 

Patients may be evaluated for a plurality of kallikrein polypeptides and optionally CA125, 
preferably in serum, in order to assist in the determination of the most effective dosing regimen and related 
factors. The assay methods described herein, or similar assays, may be used for quantitating circulating 
kallikrein polypeptide and optionally CA125 levels in patients prior to treatment Such assays may also be 

15 used for monitoring throughout therapy, and may be useful to gauge therapeutic success in combination with 
evaluating other parameters, such as serum kallikrein polypeptides, and optionally CA125, levels. 

The invention further provides vaccines formulated to contain a plurality of kallikrein polypeptides, 
and optionally CA125, or fragments thereof. The use in anti-cancer therapy of tumor antigens in a vaccine 
for generating humoral and cell-mediated immunity is well known and, for example, has been employed in 

20 prostate cancer using human PSMA and rodent PAP immunogens (Hodge et al., 1 995, Int. J. Cancer 63: 231- 
237; Fong et al., 1997, J. Immunol. 159: 3113-3117). These methods can be practiced by employing a 
plurality of kallikrein polypeptides, and optionally CA125, or fragments thereof, or nucleic acids and 
recombinant vectors capable of expressing and appropriately presenting the kallikrein and optionally CA125, 
immunogens. 

25 By way of example, viral gene delivery systems may be used to deliver nucleic acids encoding a 

plurality of kallikrein polypeptides, and optionally CA125. Various viral gene delivery systems which can be 
used in the practice of this aspect of the invention include, but are not limited to, vaccinia, fowlpox, 
canarypox, adenovirus, influenza, poliovirus, adeno-associated virus, lentivirus, and sindbus virus (Restifo, 
1996, Curr. Opin. Immunol. 8: 658-663). Non- viral delivery systems may also be employed by using naked 

30 DNA encoding a plurality of kallikrein polypeptides, and optionally CA1 25, or fragments thereof introduced 
into the patient (e.g., intramuscularly) to induce an anti-tumor response. 

Various ex vivo strategies may also be employed. One approach involves the use of cells to present 
kallikrein and optionally CA125 antigens to a patient's immune system. For example, autologous dendritic 
cells which express MHC class I and II, may be pulsed with a plurality of kallikrein polypeptides, and 

35 optionally CA125, or peptides thereof that are capable of binding to MHC molecules, to thereby stimulate 
ovarian cancer patients' immune systems (See, for example, Tjoa et ah, 1996, Prostate 28: 65-69; Murphy et 
at, 1996, Prostate 29: 371-380). 

Anti-idiotypic antibodies specific for a plurality of kallikrein polypeptides, and optionally CA125, 
can also be used in anti-cancer therapy as a vaccine for inducing an immune response to cells expressing the 
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polypeptides. The generation of anti-idiotypic antibodies is well known in the art and can readily be adapted 
to generate anti-idiotypic antibodies that mimic an epitope on a kallikrein polypeptide or CA125 (see, for 
example, Wagner et al., 1997, Hybridoma 16: 33-40; Foon et al., 1995, J Clin Invest 96: 334-342; Herlyn et 
ah, 1996, Cancer Immunol Immunother 43: 65-76). Such antibodies can be used in anti-idiotypic therapy as 
5 presently practiced with other anti-idiotypic antibodies directed against tumor antigens. 

Genetic immunization methods may be utilized to generate prophylactic or therapeutic humoral and 
cellular immune responses directed against cancer cells expressing a plurality of kallikrein polypeptides, and 
optionally CA125. Constructs comprising DNA encoding kallikrein and optionally CA125 
polypeptides/immunogens and appropriate regulatory sequences may be injected directly into muscle or skin 

10 of an individual, such that the cells of the muscle or skin take-up the construct and express the encoded 
kallikrein or CA125 polypeptides/immunogens. The polypeptides/immunogens may be expressed as cell 
surface proteins or be secreted. Expression of the polypeptides/immunogens results in the generation of 
prophylactic or therapeutic humoral and cellular immunity against the cancer. Various prophylactic and 
therapeutic genetic immunization techniques known in the art may be used. 

15 The invention further provides methods for inhibiting cellular activity (e.g., cell proliferation, 

activation, or propagation) of a cell expressing a plurality of kallikrein polypeptides, and optionally CA125. 
This method comprises reacting immunoconjugates of the invention (e.g., a heterogeneous or homogenous 
mixture) with the cell so that the kallikrein polypeptides, and optionally CA125, form complexes with the 
immunoconjugates. A subject with a neoplastic or preneoplastic condition can be treated when the inhibition 

20 of cellular activity results in cell death. 

In another aspect, the invention provides methods for selectively inhibiting a cell expressing a 
plurality of kallikrein polypeptides, and optionally CA125, by reacting a combination of immunoconjugates 
of the invention with the cell in an amount sufficient to inhibit the cell. Amounts include those that are 
sufficient to kill the cell or sufficient to inhibit cell growth or proliferation. 

25 Vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial 

plasmids, may be used to deliver nucleic acids encoding a plurality of kallikrein polypeptides, and optionally 
CA125, to a targeted organ, tissue, or cell population. Methods well known to those skilled in the art may be 
used to construct recombinant vectors that will express antisense nucleic acid molecules for kallikrein 
polypeptides and CA125. (See, for example, the techniques described in Sambrook et al (supra) and Ausubel 

30 et al (supra)). 

Genes encoding a plurality of kallikrein polypeptides, and optionally CA125, can be turned off by 
transfecting a cell or tissue with vectors that express high levels of a desired kallikrein or CA125 
polypeptide-encoding fragments. Such constructs can inundate cells with untranslatable sense or antisense 
sequences. Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 
35 molecules until all copies are disabled by endogenous nucleases. 

Modifications of gene expression can be obtained by designing antisense molecules, DNA, RNA or 
PNA, to the regulatory regions of genes encoding kallikrein polypeptides, and optionally CA125, i.e., the 
promoters, enhancers, and introns. Preferably, oligonucleotides are derived from the transcription initiation 
site, eg, between -10 and +10 regions of the leader sequence. The antisense molecules may also be designed 
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so that they block translation of mRNA by preventing the transcript from binding to ribosornes. Inhibition 
may also be achieved using "triple helix" base-pairing methodology. Triple helix pairing compromises the 
ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or 
regulatory molecules. Therapeutic advances using triplex DNA were reviewed by Gee J E et al (In: Huber B 

5 E and B I Carr (1994) Molecular and Immunologic Approaches, Futura Publishing Co, Mt Kisco N.Y.). 

Ribozymes are enzymatic RNA molecules that catalyze the specific cleavage of RNA. Ribozymes 
act by sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. The invention therefore contemplates engineered hammerhead motif ribozyme 
molecules that can specifically and efficiently catalyze endonucleolytic cleavage of sequences encoding a 

10 plurality of kallikrein polypeptides, and optionally CA125. 

Specific ribozyme cleavage sites within any potential RNA target may initially be identified by 
scanning the target molecule for ribozyme cleavage sites which include the following sequences, GUA, 
GUU and GUC. Once the sites are identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the target gene containing the cleavage site may be evaluated for secondary 

15 structural features which may render the oligonucleotide inoperable. The suitability of candidate targets may 
also be determined by testing accessibility to hybridization with complementary oligonucleotides using 
ribonuclease protection assays. 

Methods for introducing vectors into cells or tissues include those methods discussed herein and 
which are suitable for in vivo, in vitro and ex vivo therapy. For ex vivo therapy, vectors may be introduced 

20 into stem cells obtained from a patient and clonally propagated for autologous transplant into the same 
patient (See U.S. PaL Nos. 5,399,493 and 5,437,994). Delivery by transfection and by liposome are well 
known in the art 

Kallikrein polypeptides, optionally CA125 polypeptide, and/or polynucleotides encoding the 
polypeptides, and fragments thereof, antibodies and/or agents identified using a method of the invention, or 

25 combinations thereof, may be used in the treatment of ovarian cancer or diseases, conditions or syndromes 
associated with ovarian cancer, in a subject. A combination of kallikrein polypeptides and/or 
polynucleotides encoding the kallikreins (e.g. kallikreins 7 and 8) and inhibitors (antisense, antibodies, or 
agents) of other kallikreins (e.g. kallikreins 5, 6, 10 and 11) and/or CA125 may be used in a prognostic or 
therapeutic method of the invention. The polypeptides, polynucleotides, and agents may be formulated into 

30 compositions for administration to subjects suffering from ovarian cancer. Therefore, the present invention 
also relates to a composition comprising a plurality of kallikrein polypeptides and optionally CA125, or 
nucleic acids encoding the polypeptides, or a fragment thereof, or an agent identified using a method of the 
invention, and a pharmaceutically acceptable carrier, excipient or diluent A method for treating or 
preventing ovarian cancer in a subject is also provided comprising administering to a patient in need thereof, 

35 a plurality of kallikrein polypeptides and optionally CA125, or nucleic acids encoding the polypeptides, an 
agent identified in accordance with a method of the invention, and/or a composition of the invention. 

The active substance may be administered in a convenient manner such as by injection 
(subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal application, or rectal 
administration. Depending on the route of administration, the active substance may be coated in a material to 
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protect the substance from the action of enzymes, acids and other natural conditions that may inactivate the 
substance. 

The compositions described herein can be prepared by per se known methods for the preparation of 
pharmaceuticaliy acceptable compositions which can be administered to subjects, such that an effective 
5 quantity of the active substance is combined in a mixture with a pharmaceuticaliy acceptable vehicle. 
Suitable vehicles are described, for example, in Remington's Pharmaceutical Sciences (Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., USA 1985). On this basis, the 
compositions include, albeit not exclusively, solutions of the active substances in association with one or 
more pharmaceuticaliy acceptable vehicles or diluents, and contained in buffered solutions with a suitable 
10 pH and iso-osmotic with the physiological fluids. 

The compositions are indicated as therapeutic agents either alone or in conjunction with other 
therapeutic agents or other forms of treatment (e.g. chemotherapy or radiotherapy). The compositions of the 
invention may be administered concurrently, separately, or sequentially with other therapeutic agents or 
therapies. 

15 The following non-limiting examples are illustrative of the present invention: 

Example 1 

To investigate the additional discriminatory value of the kallikreins to CA125 a logistic regression 
model was developed Included in the study were serum samples from 39 ovarian cancer patients and 194 
non-cancer females. The age of the patients was as follows: median = 59, range 32-82. The age of the 

20 controls was as follows: median = 46; range = 22-77. The model was adjusted for the following variables: 
f(x) - -0.29 hK5* +0.12* hK6-0.65*hK7-0.6*hK8+1.09*^ For these 

data, the crude odds ratio and the 95% confidence interval were found to be 2.71 and 1.91-3.84 (p<0.001). 
The log likelihood scores for this multivariate logistic regression model, which incorporated the combined 
variables for each patient was calculated. From these data, by picking different thresholds for the regression 

25 function values, a ROC curve was devised which shows the added value of using kallikreins and CA125 
together in a multivariate function.( AUC, 0.99 ;95%CI,0.96- 1.00). (See Figure 8.) Statistically significant 
correlations between age and other studied variables were not observed. 
Example 2 

Statistically significant differences in serum kallikrein concentration was found between patient and 
30 control subjects for kallikreins hK5 (p<0.0001), hK7 (p=0.007), hK8 (p=0.005), hKlO (p=0.0003) and 
CA125 (p<0.0001) by the Mann-Whitney test The diagnostic sensitivity (SENS) and specificity (SPEC) for 
each one of these markers were as follows (SBNS/SPEC; both as %): 31/95 (hK5); 62/71 (hK7); 62/70 
(hK10); 54/54 (hKll); 89/94 (CA125). When these data were combined in a logistic regression model, 
kallikreins 5 and 10 did not contribute to a great extent to the sensitivity and specificity of CA125. The area 
35 under the curve of CA125 alone (93%) improved by a further 1% when adding hK6, by 2% when adding 
hKll, 3% when adding hK7 and 5% when adding hK8. The combination of CA125 and hK8 resulted in an 
AUC of 98%. 

Below is a summary of each marker and its ability to separate the cases and controls. 
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hK5: high values associated with cancer 
test* is hK5>0.10, test- is hK5<=0.10 
sensitivity=31%, specificity=95%, 
AUO.62, p(AUC)=.02 
5 Wilcoxon rank sum test has p<.0001 . 

Of the 233 persons analyzed, 207 have value zero for hK5 (27 cases, 180 controls). 
Possible good marker 

hK6: high values associated with cancer 
10 testf is hiC6>6.3, test- is hK6<=6.3 
sensitivity=69%, specificity=40%, 
AUO.50, p(AUQ=1.00 
Wilcoxon rank sum test has p=.91. 
Not a good marker 

15 

hK7: low values associated with cancer 
testf is hK7<2.05, test- is hK7>=2.05 
sensitivity=62%, specificity=71%, 
AUO.64, p(AUC)=.006 
20 Wilcoxon rank sum test has p=.007. 
Possible good marker 

hK8: low values associated with cancer 
testf is hK8<13.0, test- is hK8>=13.0 
25 sensitivity=72%, specificity=42%, 
AUO=.64, p(AUQ=.006 
Wilcoxon rank sum test has p=.005 
Possible good marker 

30 hK10: high values associated with cancer 

testf is hK10>1.42, test- is hK10<=1.42 

sensitivity=62%, specificity=70%, 

AUO.68, p(AUC)=.0004 

Wilcoxon rank sum test has p=.0003. 
35 Best single kallikrein marker 

hKl 1 : high values associated with cancer 
testf is hKll>0.14, test- is hKll<=0.14 
sensitivity ss 54%, specificity=54%, 
40 AUC=.58, p(AUQ=. 12 

Wilcoxon rank sum test has p=. 1 1. 
Not a good marker 

CA125: high values associated with cancer 
45 testf is Ca 125>34, test- is Ca 125<=34 
sensitivity=89%, specificity=94%, 
AUO.933, p(AUQ=<0001 
Wilcoxon rank sum test has p<.0001. 
Good marker 

50 

After some further multivariate analysis of only the kallikrein markers, the combination of hK 7, 8, 10 and 
1 1 was a preferred set. This combination was arrived at by looking at the incremental AUC as markers were 
combined. Below is a summary of all the models tried: 



55 



hK10 alone, AUO.68 
hK10fhK7: AUC-.88 
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hK10+hK7+hK8: AUO.90 
hK10+hK7+hK8+hKl 1 : AUC=925 

Multivariate model of hK7, hK8, hK10, hKll, call it hK7_8_10_l 1 

5 hK7_8_110_U: 

Calculate SA=2.00-1.49(hK7)-.34(hK8)+L16(hK10)+3.50(hKl 1) 
high values associated with cancer 
test* is SA>-1.15, test- is SA<=-1.15 
sensitivity=87% l specificity=89%, 
10 AUO.93, p(AUQ=<.0001 

Wilcoxon rank sum test has p<0001. 
Good marker 

The hK marker that added the most to CA125 was also investigated 
15 CA125 alone, AUO.933 
CA125+hK8: AUO.978 

Multivariate model of Cal25, hK8, call it Cal25JiK8 



Cai25_hK8: 

20 SO-1.71+.086(Cal25)-.47(hK8). 
high values associated with cancer 
test+ is SO-2.52, test- is SC<-2.52 
sensitivity=97%, specificity=90%, 
AUO.978, p(AUC)=<0001 

25 Wilcoxon rank sum test has p<. 000 1 . 
Good marker 



Below is a summary of the above analyses: 
a) The preferred kallikrien marker alone is hK10, AUO.68 
30 b) CA125hasanAUCof.933 

c) The preferred combination of kallikrein markers increases the AUC up to ,925, which is close to the 
CA125 AUC of .933 

d) Adding a kallikrein marker to CA125 can increase the AUC up to .978 



35 How does CA125 alone compare with the multivariate kallikrein model hK7_8_10_l 1? 
(based on 39 cases and 186 controls evaluated with CA125) 
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CA125 
hK7_8_10_U 
both positive 
either positive 



Sensitivity Specificity misclassification 

90% 94% 12FP, 4FN, total 16 pts misclassified 

85% 89% 3 1FP, 4FN, total 35 pts misclassified 

77% 100% 0FP, 9FN, total 9 pts misclassified 

97% 82% 33FP, 1FN, total 34 pts misclassified 



How does CA125 alone compare with the multivariate model of CA125 plus hK8? 

45 (based on 39 cases and 186 controls evaluated with CA125) 

Sensitivity Specificity misclassification 
CA125 90% 94% 12FP, 4FN, total 16 pts misclassified 

CA125JiK8 95% 91% 17FP, 2FN, total 19 pts misclassified 
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Kallikrein markers approach CA125 in terms of AUC and sensitivity, but their specificity is not as 
high. Adding hK8 to CA125 improves sensitivity but specificity is lower than CA125 alone. 
Summary 

5 a) The best kallikrein marker alone is hKl 0, area under the ROC curve (AUC) =.68. 

b) CA125 has an AUC of .933. Adding a single kallikrein marker to CA125 can get the AUC 
up to .978. Adding hK8 to CA125 improves sensitivity but specificity is lower compared 
with CA125 alone. 

c) The best combination of kallikrein markers gets the AUC up to .925, which is close to the 
10 CA125 AUC of .933. Kallikrein markers approach CA125 in terms of AUC and 

sensitivity, but their specificity is lower. 



The present invention is not to be limited in scope by the specific embodiments described herein, 

15 since such embodiments are intended as but single illustrations of one aspect of the invention and any 
functionally equivalent embodiments are within the scope of this invention. Indeed, various modifications of 
the invention in addition to those shown and described herein will become apparent to those skilled in the art 
from the foregoing description and accompanying drawings. Such modifications are intended to fall within 
the scope of the appended claims. 

20 All publications, patents and patent applications referred to herein are incorporated by reference in 

their entirety to the same extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference in its entirety. All publications, 
patents and patent applications mentioned herein are incorporated herein by reference for the purpose of 
describing and disclosing the domains, cell lines, vectors, methodologies etc. which are reported therein 

25 which might be used in connection with the invention. Nothing herein is to be construed as an admission that 
the invention is not entitled to antedate such disclosure by virtue of prior invention. 

It must be noted that as used herein and in the appended claims, the singular forms "a", "an", and 
"the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a 
host cell" includes a plurality of such host cells, reference to the "antibody" is a reference to one or more 

30 antibodies and equivalents thereof known to those skilled in the art, and so forth. 

Below full citations are set out for the references referred to in the specification. 
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Table 1 



KaUikrein Polypeptide 


Kallikrein Nucleic Acid 
Designation 


GenBank Accession No. 


Kallikrein 5 


KLK5 


AAD26429, AF135028, AF168768 


Kallikrein 6 


KLK6 


AAB66483, AF013988 (CDS 174..881), 
AF149289 (CDS join 3567..3606, 4346..4502, 
8122..8369, 979 1.. 9927, 11 805. .11 957) U62801 
(CDS 246..980) 


Kallikrein 7 


KLK7 


AAC37551, L33404 (CDS 16..777), AF166330 
(CDS join 3237.3309, 3722..3869, 4566..4813, 
5129..5265, 7362..7517) (mRNA join( 1756.. 1785, 
3179.3309, 3722.3869, 4566..4813, 5129..5265, 
73 62.. 8265) /product="stratum corneum 
chymotryptic enzyme" /note= M alternatively 
spliced" ; mRNA join (1756.. 1785, 3179..3309, 
3722.3869, 4566..4813, 5129..5265, 7362..7991) 
/note="alternatively spliced"; mRNA join 
(1821.. 1864, 3179..3309, 3722.3869, 4566..4813, 
5129..5265, 7362..8265) /product= H stratum 
corneum chymotryptic enzyme" 
/note="alternatively spliced"; mRNA join 
(1821..1864, 3179..3309, 3722.3869, 4566..4813, 
5129..5265, 7362..7991) /note="alternatively 
spliced" 


Kallikrein 8 


KLK8 


BAA28673, AB009849 (CDS 35..817), 
AF095743 (CDS join 1035..1104, 1619..1778, 
1944..2206, 4304..4437, 5974..6129, mRNA 
500..670, 1027..1104, 1619..1778, 1944..2206, 
4304..4437, 5974..6174), AB010780 (CDS join 
1.39, 418..712, 878..>946), AF055982 


Kallikrein 10 


KLK10 


AAC14266, AF055481 (CDS join 614..701, 
2455..2635, 3589.3863, 4195..4328, 4793.4945, 
mRNA join 48.. 120, 605..701, 2455..2635, 
3589.3863, 4195..4328, 4793..S474), 
NM_002776 (CDS 220.. 1050) 


Kallikrein 11 


KLK11 


BAA33404, AAD47815, AB012917 (CDS 
26..874), AF164623 (CDS 4224..4263, 
5061..5217, 5545..5810, 6627..6763, 7158..7310) 
(mRNA join (2313..2398, 4189..4263, 
5061..5217,5545..5810,6627..6763,7158..7622) 
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Table 2 

Descriptive statistics for hk5, hk6, hk7, hk8, hklO and hkll serum protein levels in controls and 

patients with ovarian cancer 



hk5 (ng/ml) 
Non cancer (N=194) 
Cancer (N=3 9) 
% Increase** 



Mean Standard Error Median Range p value* 

0.063 0.029 0.00 0.00-4.50 

0.48 0.18 0.00 0.00-5.70 

661% - - <0.001 



hk6(ng/mn 

Non cancer (N=l 94) 6.96 

Cancer (N=39) 9.88 

% Increase**' 42% 



0.18 
2.20 



6.60 
6.60 



1.60-15.30 
1.50-70.80 



0.91 



hk7fng/ml) 

Non cancer (N=194) 2.60 

Cancer (N=39) 2.49 

% Decrease** 4% 



0.071 
0.41 



2.67 
1.80 
33% 



0.30-6.00 
0.00-10.80 



0.007 



hk8(ng/rol) 
Non cancer (N=194) 
Cancer (N=39) 
% Decrease** 



11.74 
11.91 



0.27 
1.88 



11.70 
6.90 
41% 



2.40-22.20 
0.00-46.20 



0.005 



hK10(ng/mn 

Non cancer (N=194) 1 . 16 

Cancer (N=39) 6.51 

% Increase** 461% 



0.051 
2.46 



1.08 
1.59 
40% 



0.00-4.20 
0.27-90.0 



O.001 



hKll(ng/mn 

Non cancer (N=194) 0.21 

Cancer (N=39) 0.79 

% Increase** 276% 



0.018 
0.21 



0.12 
0.18 
50% 



00-1.30 
0.00-5.52 



0.011 



10 



* Calculated by the Mann Whitney test 

** Calculated by assuming that value in non-cancerouos tissue is 100% 
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Table3 

Correlations between the studied variables in 194 non-cancer cases 



variable 



hK5 



hK6 



hK7 



hK3 hK10 hKll CA125 



hK5 



hK6 



hK7 



hK8 



hKlO 



hKll 



CA125 



r 8 1.000 



0.034 
0.642 
-0.053 
0.462 
0.066 
0.359 
0.134 
0.062 
0.150 
0.037 
0.101 
0.172 



P 
P 



0.034 -0.053 0.066 0.134 0.150 0.101 
0.642 0.462 0.359 0.062 0.037 0.172 



1.000 

0.114 
0.115 
0.298 
0.000 
0.191 



0.114 0.298 0.191 0.120 



1.000 0.497 0.321 



0.000 0.000 



0.497 1.000 0.263 



0.000 



-0.160 0.135 0.048 



0.000 



0.321 0.263 1.000 



0.008 0.000 0.000 

0.120 0.399 0.396 0.176 

0.097 0.000 0.000 0.014 



0.035 



0.029 0.066 0.519 0.638 



0.399 
0.000 
0.396 
0.000 
0.176 
0.014 
1.000 

0.125 
0.090 



-0.160 



0.115 0.000 0.008 0.097 0.029 



0.135 
0.066 
0.048 
0.519 
0.035 
0.638 
0.125 
0.090 
1.000 
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Table 4 

Correlations between the studied variables in 39 ovarian cancer cases 



variable 



hK5 



hK6 



iiK7 



hK8 hK10 hKll CA125 



hK5 



hK6 



hK7 



hK8 



hK10 



hKll 



CA125 



1.000 

0.475 
0.002 
0.553 
0.000 
0.554 
0.000 
0.618 
0.000 
0.584 
0.000 
0.507 
0.001 



0.475 
0.002 
1.000 

0.327 
0.042 



0.003 



0.553 0.554 0.618 0.584 

0.000 0.000 0.000 0.000 

0.327 0.513 0.470 0.661 

0.042 0.001 0.003 0.000 

1.000 0.695 0.690 0.748 



0.001 0.000 



0.530 0.262 



0.000 0.000 



0.513 0.695 1.000 0.602 



0.470 0.690 0.602 LOOO 



0.000 0.000 



0.000 0.000 0.000 



0.443 



0.000 



0.783 



0.000 0.000 



0.706 



0.000 



0.000 



0.548 0.556 



0.507 
0.001 
0.530 
0.001 
0.262 
0.107 
0.443 
0.005 
0.548 
0.000 



0.661 0.748 0.783 0.706 1.000 0.556 



0.000 



1.000 



0.001 0.107 0.005 0.000 0.000 
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We Claim ; 

1. A method for detecting a plurality of kallikrein markers associated with ovarian cancer in a patient 
comprising: 

5 (a) obtaining a sample from a patient; 

(b) detecting in the sample a plurality of kallikrein markers and optionally CA125, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(c) comparing the detected amounts with amounts detected for a standard. 

10 2. A method for diagnosing and monitoring ovarian cancer in a subject comprising detecting in a 
sample from the subject a plurality of kallikrein markers, wherein the kallikrein markers comprise 
or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10, and kallikrein. 

3. A method as claimed in claim 1 or 2 wherein the plurality of kallikrein markers are detected using 
1 5 antibodies that bind to each of the plurality of kallikrein markers or parts thereof 

4. A method as claimed in claim 1, 2, 3 which further comprises detecting CA125. 

5. A method of detecting ovarian cancer in a patient, the method comprising comparing: 

(a) levels of a plurality of kallikrein markers, and optionally CA125, in a sample from the 
patient, wherein the kallikrein markers comprise or are selected from the group consisting 

20 of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and 

(b) normal levels of expression of the plurality of kallikrein markers, and optionally CA 1 25, in 
a control sample, wherein a significant difference in levels of kallikrein markers and 
optionally CA125, relative to the corresponding normal levels, is indicative of ovarian 
cancer. 

25 6. A method for monitoring the progression of ovarian cancer in a patient, the method comprising: (a) 
detecting in a sample from the patient at a first time point, a plurality of kallikrein markers, wherein 
the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 
6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; (b) repeating step (a) at a subsequent 
point in time; and (c) comparing levels detected in steps (a) and (b), and thereby monitoring the 

30 progression of ovarian cancer. 

7. A method for determining in a patient whether ovarian cancer has metastasized or is likely to 
metastasize in the future, the method comprising comparing (a) levels of a plurality of kallikrein 
markers, and optionally CA125, in a patient sample, wherein the kallikrein markers comprise or are 
selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 

35 10, and kallikrein 1 1; and (b) normal levels or non-metastatic levels of the kallikrein markers and 

optionally CA125, in a control sample wherein a significant difference between the levels of 
expression in the patient sample and the normal levels or non-metastatic levels is an indication that 
the ovarian cancer has metastasized. 

8. A method for assessing the aggressiveness or indolence of ovarian cancer comprising comparing: 



WO 2004/075713 PCT/CA2004/000281 



-43- 

(a) levels of expression of a plurality of kallikrein markers, and optionally CA 125, in a patient 
sample, wherein the kallikrein markers comprise or are selected from the group consisting of 
kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and (b) normal 
levels of expression of the plurality of markers and optionally CA125, in a control sample, wherein 
5 a significant difference between the levels in the patient sample and normal levels is an indication 

that the cancer is aggressive or indolent. 

9. A method for diagnosing and monitoring ovarian cancer in a sample from a subject comprising 
isolating nucleic acids from the sample; and detecting in the sample polynucleotides encoding a 
plurality of kallikrein markers, and optionally CA125, wherein the kallikrein markers comprise or 

10 are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 

kallikrein 10, and kallikrein. 

10. A method as claimed in claim 9 wherein significant differences in the levels of the polynucleotides 
in the sample compared to a control is indicative of disease, disease stage, and/or prognosis. 

11. A method for determining the presence or absence of ovarian cancer in a subject comprising: (a) 
15 contacting a sample obtained from the subject with oligonucleotides that hybridize to 

polynucleotides encoding kallikrein markers, and optionally CA125, wherein the kallikrein markers 
comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11; and (b) detecting in the sample a level of nucleic acids 
in the sample that hybridize to the polynucleotides relative to a predetermined cut-off value, and 
20 therefrom determining the presence or absence of ovarian cancer in the subject. 

12. A method as claimed in claim 11, wherein the nucleic acids are mRNA and the levels of nucleic 
acids are detected by polymerase chain reaction. 

13. A method as claimed in claim 1 1 wherein the nucleic acids are mRNA and the amounts of mRNA 
are detected using a hybridization technique, employing oligonucleotide probes that hybridize to 

25 kallikrein markers, and optionally CA 125. 

14. A method for assessing the potential efficacy of a test agent for inhibiting ovarian cancer in a 
patient, the method comprising comparing: (a) levels of a plurality of kallikrein markers, optionally 
CA125, and/or polynucleotides encoding same, in a first sample obtained from a patient and 
exposed to the test agent, wherein the kallikrein markers comprise or are selected from the group 

30 consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1, and 

(b) levels of the plurality of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same, in a second sample obtained from the patient, wherein the sample is not exposed to the test 
agent, wherein a significant difference in the levels of expression of the plurality of kallikrein 
markers, optionally CA125, and/or polynucleotides encoding same, in the first sample, relative to 

35 the second sample, is an indication that the test agent is potentially efficacious for inhibiting ovarian 

cancer in the patient 

15. A method of claim 14 wherein the first and second samples are portions of a single sample obtained 
from the patient. 

16. A method of claim 14 wherein the first and second samples are portions of pooled samples obtained 
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firom the patient. 

17. A method of assessing the efficacy of a therapy for inhibiting ovarian cancer in a patient, the 
method comprising comparing: (a) levels of a plurality of kallikrein markers, optionally CA125, 
and/or polynucleotides encoding same, in a first sample obtained from the patient, wherein the 

5 kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 

kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11, and (b) levels of the kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, in a second sample obtained from the 
patient following therapy, wherein a significant difference in the levels of expression of the 
kallikrein markers, optionally CA125, and/or polynucleotides encoding same, in the second sample, 

10 relative to the first sample, is an indication that the therapy is efficacious for inhibiting ovarian 

cancer in the patient. 

18. A method of selecting an agent for inhibiting ovarian cancer in a patient the method comprising (a) 
obtaining a sample comprising cancer cells from the patient; (b) separately exposing aliquots of the 
sample in the presence of a plurality of test agents; (c) comparing levels of a plurality of kallikrein 

15 markers, optionally CA125, and/or polynucleotides encoding same, in each of the aliquots, wherein 

the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 
6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and (d) selecting one of the test agents 
which alters the levels of kallikrein markers, optionally CA125, and/or polynucleotides encoding 
same, in the aliquot containing that test agent, relative to other test agents. 

20 19. A method of inhibiting ovarian cancer in a patient, the method comprising (a) obtaining a sample 
comprising cancer cells from the patient; (b) separately maintaining aliquots of the sample in the 
presence of a plurality of test agents; (c) comparing levels of a plurality of kallikrein markers, 
optionally CA125, and/or polynucleotides encoding same, in each of the aliquots, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 

25 kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; and (d) administering to the patient at 

least one of the test agents which alters the levels of kallikrein markers, optionally CA125, and/or 
polynucleotides encoding same, in the aliquot containing that test agent, relative to other test agents. 

20. A method of assessing the ovarian cell carcinogenic potential of a test compound, the method 
comprising: (a) maintaining separate aliquots of ovarian cells in the presence and absence of the test 

30 compound; and (b) comparing expression of a plurality of markers, optionally CA125, and/or 

polynucleotides encoding same, in each of the aliquots, wherein the markers comprise or are 
selected from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 
10, and kallikrein 1 1, and wherein a significant difference in levels of kallikrein markers, optionally 
CA125, and/or polynucleotides encoding same, in the aliquot maintained in the presence of the test 

35 compound, relative to the aliquot maintained in the absence of the test compound, is an indication 

that the test compound possesses ovarian cell carcinogenic potential. 

21. A method of inhibiting ovarian cancer in a patient at risk for developing ovarian cancer, the method 
comprising inhibiting expression of genes encoding kallikrein markers and optionally CA125, 
wherein the kallikrein markers comprise or are selected from the group consisting of kallikrein 5, 
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kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1. 

22. A method of any preceding claim wherein the plurality comprises at least three of the markers. 

23. A method of any preceding claim wherein the plurality comprises at least five of the markers. 

24. A method of any preceding claim wherein the plurality of kallikrein markers is selected from the 
5 group consisting of kallikrein 5, kallikrein 7, and kallikrein 8; kallikrein 5, kallikrein 8, and 

kallikrein 10; kallikrein 7, kallikrein 8, and kallikrein 10; kallikrein 5, kallikrein 7, kallikrein 8, and 
kallikrein 10; kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 11; kallikrein 5, kallikrein 7, 
kallikrein 8, kallikrein 10, and kallikrein 11; or kallilkrein 5, kallikrein 6, kallikrein 7, kallikrein 8, 
kallikrein 10 and kallikrein 11. 
10 25. A method of any proceeding claims wherein the kallikrein markers are kallikrein 7, kallikrein 8, 
kallikrein 1 0 and kallirkein 11. 

26. A method of any preceding claim wherein the patient sample comprises serum obtained from the 
patient 

27. A kit for carrying out a method as claimed in any preceding claim. 

15 28. A kit for assessing whether a patient is afflicted with ovarian cancer, the kit comprising reagents 
that specifically bind with a plurality of kallikrein markers and optionally CA125, wherein the 
kallikrein markers comprise or are selected from the group consisting of kallikrein 5, kallikrein 6, 
kallikrein 7, kallikrein 8, kallikrein 10, and kallikrein 1 1. 

29. A kit for assessing the suitability of each of a plurality of agents for inhibiting ovarian cancer in a 
20 patient, the kit comprising: (a) the plurality of agents; and (b) reagents for detecting a plurality of 

kallikrein markers and optionally CA125, wherein the kallikrein markers comprise or are selected 
from the group consisting of kallikrein 5, kallikrein 6, kallikrein 7, kallikrein 8, kallikrein 10, and 
kallikrein 11. 

30. A kit as claimed in claim 28 or 29 wherein the reagents are antibodies that specifically bind with 
25 protein or protein fragments corresponding to kallikrein markers and optionally CA125. 
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Figure 3 
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Figure 5 
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Figure 6 
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Figure 7 
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Sequence Listing 



SEQ ID NO. 1 

CA125 amino acid 

5 1 ralkpsglpgs ssptrslmtg srstkatpera dsgltgatls pktstgaiw tehtlpftsp 

61 dktlasptss wgrttqslg vmssalpest srgmthseqr tspslspqvn gtpsmypat 

121 sravsglsspr trtsstegnf tkeastytlt vettsgpvte kytvptetst tegdstetpw 

181 dtryipvkit spmktfadst askenapvsm tpaettvtds htpgrtnpsf gtlyssfldl 

241 spkgtpnsrg etalelilst tgypfsspep gsaghsrist saplsssasv ldnkisetsi 

10 301 fsgqsltspl spgvpearas tmpnsaipfs mtlsnaetsa ervrstissl gtpsistkqt 

361 aetiltfhaf aetmdipsth iaktlasewl gspgtlggts tsaltttsps ttlvseetnt 

421 hkstBgkete gtlntsmtpl etsapgeese mtatlvptlg fttldskirs psqvssshpt 

481 relrttgsts grqssstaah gssdilratt sstskasswt sestaqqfse pqhtqwvets 

541 psmkterppa stsvaapitt svpswsgft tlktsstkgi wleetsadtl igestagptt 

15 601 hqfavptgis ratggsstrgs qgtthlltra tassetsadl tlatngvpvs vspavsktaa 

661 gssppggtkp sytravssvip etsslqssaf regtslgltp lntrhpfssp epdsaghtki 

721 atsipllssa svledkvsat stfshhkats sittgtpeis tktkpssavl ssratlsnaat 

781 8pervrnats plthpspsge etagsvltle tBaettdspn ihptgtltse ssespstlsl 

841 psvsgvkttf ssstpsthlf tsgeeteets npsvsqpets vsrvrttlas tsvptpvfpt 

20 901 radtwptrsaq fssshlvsel ratsstsvtn stgsalpkis hltgtatmsq tnrdtfndsa 

961 apqsttwpet sprfktglps atttvstsat slsatvmvsk ftspatssme atsirepstt 

1021 ilttettngp gsmavastni pigkgyiteg rldtshlpig ttassetsmd ftraakeBvsra 

1081 svspsqsmda agsstpgrts qfvdtfsddv yhltsreiti prdgtssalt pqratathpps 

1141 pdpgsarstw lgilssspss ptpkvtmsst fstqrvttsm imdtvetsrw ntnpnlpstts 

25 1201 ltpsniptsg aigkstlvpl dtpspatsle asegglptls typestntps ihlgahasse 

1261 spstikltma swkpgsytp ltfpsiethi hvstarmays sgssperatap getntgstwd 

1321 pttyitttdp kdtSBaqvst phBvrtlrtt enhpktesat paaysgspki ssspnltspa 

1381 tkawtitdtt ehstqlhytk laekssgfet qsapgpvsw iptsptigss tleltsdvpg 

1441 eplvlapseq ttitlpmatw latslteema stdldissps sprastfaifp pmstpshels 

30 1501 kseadtsair ntdsttldqh lgirslgrtg dlttvpitpl tttwtsvieh stqaqdtlsa 

1561 tmspthvtqs lkdqtsipas aspshltevy pelgtqgrss seattfwkps tdtlsreiet 

1621 gptniqstpp mdntttgsss sgvtlgiahl pigtsspaet stnmalerrs statvsmagt 

1681 mgllvtsapg rsisqslgrv ssvlsestte gvtdsskgss prlntqgnta lssslepsya 

1741 egsqmstsip ltsspttpdv ef iggstfwt kevttvmtsd iskssartes ssatlmstal 

35 1801 gstentgkek lrtasmdlps ptpsmevtpw isltlsnapn ttdsldlshg vhtssagtla 

1861 tdrslntgvt rasrlengsd tsskslsmgn sthtsmtdte ksevsssihp rpetsapgae 

1921 ttltstpgnr aisltlpfss ipveevistg itsgpdinsa pmthspitpp tivwtstgtl 

1981 eqstqplhav ssekvsvqtq stpyvnsvav saspthensv BBgBstsspy ssasleslds 

2041 tisrrnaits wlwdlttslp tttwpstsls ealssghsgv snpsstttef plfsaastsa 

40 2101 akqrnpetet hgpqntaast lntdassvtg lsetpvgasi ssevplpmai tsrsdvsglt 

2161 sestanpslg tassagtklt rtislptses lvsfrrankdp wtvBiplgsb pttntetsip 

2221 vnsagppgls tvasdvidtp sdgaesiptv sfspspdtev ttishfpekt thsfrtissl 

22 81 theltsrvtp ipgdwmssam stkptgasps itlgerxtit saapttspiv ltasftetst 

2341 vsldnettvk tsdildarkt nelpsdssss sdlintsias stmdvtktas isptsisgmt 

45 2401 assspslfss drpqvptstt etntatBpsv ssntysldgg snvggtpatl ppftithpve 

2461 tssallawsr pvrtfstnrvs tdtasgenpt ssnswtsvp apgtwasvgs ttdlparagfl 

2521 ktspageahs llastiepat aftphlsaav vtgssatsea Bllttseska ihsspqtptt 

2581 ptsganwets atpesllwt etsdttltsk ilvtdtilfs tvstppskfp stgtlBgasf 

2641 ptllpdtpai pltateptss latsfdstpl vtiasdslgt vpettltmse tsngdalvlk 

50 2701 tvsnpdrsip gitiqgvtes plhpsstsps kivaprntty egsitvalst lpagttgslv 

2761 fsqgsenset talvdssagl erasvmpltt gsqgraassgg irsgsthstg tktfsslplt 

2821 ranpgevtams eittnrltat qstapkgipv kptsaesgll tpvsasssps kafaslttap 

2881 pstwgipqst ltfefsevps Idtksaslpt pgqslntipd sdastasssl skspeknpra 

2941 rmmtstkais assfqstgft etpegsasps tnagheprvpt sgtgdpryas esmsypdpsk 

55 3001 assaratstsl asklttlfst gqaarsgsss spislsteke tsflsptast srktslflgp 

3061 smarqpnilv hlqtsaltls ptstlnrasqe eppeltSBqt iaeeegttae tqtltftpse 

3121 tptsllpvBS pteptarrks spetwassis vpaktslvet tdgtlvttik mssqaaqgns 

3181 twpapaeetg tspagtspgs pevsttlkim sskepsispe irstvraspw ktpettvpme 

3241 ttvepvtlqs talgsgstsi shlptgttsp tksptenmla tervslspsp peawtnlysg 
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3301 tpggtrqsla tmssvslesp tarsitgtgq qsspelvskt tgraefsmwhg stggttgdth 
3361 vslstssnil edpvtspnsv ssltdkskhk tetwvsttai pstvlnnkim aaeqqtsrsv 
3421 deaysstssw sdqtsgsdit lgaspdvtnt lyitstaqtt slvslpsgdq gitsltnpsg 
3481 gktssassvt spsigletlr anvsavksdi aptaghlsqt sspaevsild vttaptpgis 
5 3541 ttittmgtns istttpnpev ginstmdstpa terrttsteh pstwsstaas dswtvtdmts 

3601 nlkvarspgt istmhttsfl assteldsms tphgritvig tslvtpssda savktetets 
3661 ertlspsdtt astpistfsr vqrmsisvpd ilstswtpss teaedvpvsm vstdhastkt 
3721 dpntplstfl fdslstldwd tgrslssata ttsapqgatt pqeltletmi spatsqlpfs 
3781 ighitsavtp aamarssgvt fsrpdptskk aeqtstqlpt ttsahpgqvp rsaattldvi 

10 3841 phtaktpdat fqrqgqtalt tearatsdsw nekekstpsa pwitemmnsv sedtikevts 

3901 sssvlkdpey aghklgiwdd fipkfgkaah mrelpllspp qdkeaihpst ntvettgwvt 
3961 ssehashsti pahsassklt spwttstre qaivsrasttt wpestrarte pnsfltielr 
4021 dvspyradtSB ttqtsiissp gstaitkgpr teitsskris ssflaqsmrs sdspseaitr 
4081 lsnfparates ggmilatnqts ppgatslsap tldtsatasw tgtplattqr ftysekttlf 

15 4141 skgpedtsqp sppsveetsB Bsslvpibat tspsnillts qghspsstpp vtBvflsets 

4201 glgkttdmsr islepgtslp pnlsstagea lstyeasrdt kaihhsadta vtnmeatsse 
4261 yspipghtkp skatsplvts hiragditsst svfgssette ietvssvnqg lqerstsqva 
4321 ssatetstvi thvssgdatt hvtktqatfs sgtsissphq fitstntftd vBtnpstsli 
4381 mtessgvtit tqtgptgaat qgpylldtst rapyltetpla vtpdfmqsek ttliskgpkd 

20 4441 vtwtsppsva etsypssltp flvttippat stlqgqhtss pvsatsvlts glvkttdmln 

4501 tsraepvtnsp qnlnnpsnei latlaattdi etihpsinka vtnragtassa hvlhstlpva 
4561 sepstatspra vpassmgdal asisipgset tdiegeptss ltagrkenst lqeransttes 
4621 niilsnvsvg aiteatkraev psfdatfipt paqstkfpdi fsvassrlsn sppmtisthra 
4681 tttqtgssga tskiplaldt Btletsagtp swtegfaiiB kittamrmdv kdvsqtnppf 

25 4741 qdeasspssq apvlvttlps svaftpqwhs tsspvsmssv ltsslvktag kvdtsletvt 

4801 sspqsmsntl ddisvtsaat tdietthpsi ntwtnvgtt gBafeshstv saypepskvt 
4861 spnvttBtme dttisrsipk sskttrtete ttssltpklr etBisqeits stetstvpyk 
4921 eltgattevs rtdvtsssst sfpgpdqstv sldistetnt rlstspirate saeitittqt 
4981 gphgatsqdt ftradpsnttp qagihsamth gfsqldvttl msripqdvsw tsppsvdkts 

30 5041 spssflsspa rattpslisst Ipedklsspm tslltsglvk itdilrtrle pvtsslpnf s 

5101 stsdkilats kdskdtkeif psinteetnv kannsghesh epaladsetp kattqmvitt 
5161 tvgdpapsts rapvhgssett nlkreptyfl tprlretsts qessfptdts fllskvptgt 
5221 itevsstgvn ssskistpdh dkstvppdtf tgeiprvfts siktksaerat ittqasppes 
5281 ashstlpldt sttlsqggth stvtqgfpys evttlmgmgp gnvswmttpp veetsBvssl 

35 5341 msspamtsps pvsstspqsi pssplpvtal ptsvlvtttd vlgttspesv tssppnlssi 

5401 therpatykd tahteaamhh stntavtnvg tsgsghksqs svladsetsk atplmsttst 
5461 lgdtsvstst pnisqtnqiq teptaslspr lresstsekt ssttetntaf syvptgaitq 
5521 asrteisssr tsisdldrpt iapdistgmi trlftspimt ksaemtvttq tttpgatsqg 
5581 ilpwdtsttl fqggthstvs qgfphseitt lrsrtpgdvs wrattppveet ssgfalmsps 

40 5641 mtspspvsst spesipsspl pvtalltsvl vtttnvlgtt spetvtsBpp nlsBptqerl 

5701 ttykdtahte amhasmhtiit avanvgtsis ghesqssvpa dshtskatsp mgitfamgdt 
5761 svststpaff etriqtests slipglrdtr tseeintvte tstvlsevpt ttttevsrte 
5821 vitssrttis gpdhskraspy iatetitrls tfpfvtgste maitnqtgpi gtisqatltl 
5881 dtSBtasweg thspvtqrfp hseetttrasr stkgvswqsp psveetssps spvplpaits 

45 5941 hsslysavsg ssptsalpvt slltsgrrkt idmldthsel vtsslpsass fsgeiltsea 

6001 stntetihfs entaetnmgt tnsmhklhss vsihsqpsgh tppkvtgsrara edaivststp 
6061 gspetknvdr dstspltpel kedstalvmn stteBiitvfs svsldaatev sraevtyydp 
6121 tfmpasaqst kspdispeas sshsnspplt isthktiatq tgpsgvtslg qltldtstia 
6181 tsagtpsart qdfvdsetts vmnndlndvl ktspfsaeea nslssqapll vttspspvts 

50 6241 tlqehstSBl vsvtsvptpt lakitdmdtn lepvtrspqn Irntlatsea ttdthtmhps 

6301 intamanvgt tsspnefyft vspdsdpyka tsawitsts gdsivststtip rssamkkies 
6361 ettfslifrl retstsqkig ssedtstvfd kaftaattev srteltsssr tsiqgtekpt 
6421 raspdtstrsv tmlstfaglt kseertiatq tgphratsqg tltwdtsitt sqagthsamt 
6481 hgfsqldlst ltsrvpeyis gtsppsvekt BSBssllslp aitspspvpt tlpesrpssp 

55 6541 vhltslptsg lvkttdmlas vaslppnlgs tshkipttse dikdtekmyp stniavtavg 

6601 tttsekesys svpayseppk vtspravtsfn irdtivstsm pgsBeitrie mestf svahg 
6661 lkgtstsqdp ivsteksavl hklttgatet Brtevassrr tsipgpdhst espdistevi 
6721 pslpislgit essnmtiitr tgpplgstsq gtftldtptt ssragthsma tqefphserat 
6781 tvmnkdpeil swtippsiek tsfssslmps pamtsppvss tlpktihttp spmtslltpB 

60 6841 lvmttdtlgt spepttsspp nlsstshvil ttdedttaie arahpststaa tnvettcsgh 

6901 gsqssvltds ektkatapmd ttstmghttv stsmsvsset tkikrestys ltpglretsi 
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6961 sqnasfstdt sivlsevptg ttaevsrtev tssgrtsipg psqstvlpei strtmtrlfa 

7021 sptmtesaem tiptqtgpsg stsqdtltld tsttksqakt hstltqrfph semttlmsrg 

70 Bl pgdmswqssp slenpsslps llslpattsp ppisstlpvt isssplpvts lltsspvttt 

7141 dralhtspelv tssppklsht sderlttgkd ttnteavhps tntaasnvei psfghespss 

5 7201 aladsetska tspmfitstq edttvaistp hfletsriqk esisslspkl retgssvets 

7261 saietsavls evsigattei srtevtsssr tsisgBaest ralpeisttrk iikfptspil 

7321 aesseratikt qtsppgstse stftldtstt pslvithstm tqrlphseit tlvsrgagdv 

7381 prpsslpvee tsppssqlsl samispspvs stlpasshss sasvtspltp gqvkttevld 

7441 asaepetssp pslsstsvei latsevttdt ekihpfpnta vtkvgtsssg hespssvlpd 

10 7501 settkatsara gtisimgdts vstltpalsn trkiqsepas slttrlrets tseetslate 

7561 antvlskvst gattevsrte aisfsrtsms gpeqstmsqd isigtipris assvltesak 

7621 mtittqtgps estlestlnl ntattpswve thsiviqgfp hpemttsmgr gpggvswpsp 

7681 pfvketspps splslpavts phpvsttfla hippsplpvt slltsgpatt tdilgtstep 

7741 gtssssslst tsherlttyk dtahteavhp stntggtnva ttssgyksqs svladssprac 

15 7801 ttstmgdtsv Itstpaflet rriqtelass ltpglressg segtssgtkm stvlskvptg 

7861 atteiskedv tsipgpaqst ispdistrtv swfstspvmt esaeitmnth tsplgattqg 

7921 tstlatsstt sltmthstis qgfshsqrast lmrrgpedvs wmsppllekt rpsfslmssp 

7981 attspspvss tlpeaissBp lpvtslltsg lakttdmlhk ssepvtnspa nlsstsveil 

8041 atsevttdte kthpssnrtv tdvgtsssgh estsfvlads qtskvtsprav itstraedtsv 

20 8101 ststpgffet sriqteptss ltlglrktss segtslatem stvlsgvptg ataevsrtev 

8161 tsssrtsisg faqltvspet stetitrlpt sslmtesaem miktqtdppg stpesthtvd 

8221 isttpnwvet hstvtqrfBh semttlvsrs pgdmlwpsqs sveetssass llslpattsp 

8281 spvsstlved fpsaslpvts lltpglvitt drmgisrepg tsstsnlsst sherlttled 

8341 tvdtedroqps thtavtnvxt sisghesqss vlsdsetpka tspmgttytm getsvsists 

25 8401 dffetsriqi eptssltsgl retssseris sategstvls evpsgattev srtevisBrg 

8461 tsmsgpdqft ispdisteai trlstspimt esaesaitie tgspgatseg tltldtsttt 

8521 fwsgthstas pgfshsemtt lmsrtpgdvp wpBlpsveea ssvssslssp amtstsffsa 

8581 lpesisssph pvtalltlgp vkttdmlrts sepetssppn lsstsaeila tsevtkdrek 

8641 ihpssntpw nvgtviykhl spssvladlv ttkptspmat tstlgntsvs tstpafpetm 

30 8701 mtqptsslts glreistsqe tssatersas lsgmptgatt kvsrtealsl grtstpgpaq 

8761 stispeiste titristplt ttgsaemtit pktghsgass qgtftldtss raswpgthsa 

8821 athrsphsgm ttpmsrgped vswpsrpsve ktsppsslvs lsavtspspl yBtpsesshs 

8881 splrvtslft pvmmkttdml dtslepvtts ppstnnitsde slatskatrae teaiqlsent 

8941 avtqragtisa rqefyssypg Ipepskvtsp wtsstikdi vsttipasse itriemests 

35 9001 tltptprets tsqeihsatk pstvpykalt satiedsmtq vmsssrgpsp dqstmsqdis 

9061 sevitrlsts pikaestemt ittqtgspga tsrgtltldt sttfrasgths tasqgfshsq 

9121 mtalmsrtpg dvpwlshpsv eeassasfsl sspvmtsssp vsstlpdsih ssslpvtsll 

9181 tsglvkttel lgtssepets sppnlsstsa eilattevtt dteklemtnv vtsgythesp 

9241 ssvladsvtt katssmgity ptgdtnvlts tpafsdtsri qtksklsltp glmetsisee 

40 93 01 tssatekstv lssvptgatt evsrteaiss srtsipgpaq stmssdtsrae titristplt 

9361 rkestdmait pktgpsgats qgtftldsss taswpgthsa ttqrfpqsw ttpmsrgped 

9421 vswpsplsve knsppsslvs sssvtspspl ystpsgsshs spvpvtslft siramkatdml 

9481 daslepetts apnranitsde slatskatte teaihvfent aashvettsa teelyssspg 

9541 fseptkvisp wtsBsirdn ravsttmpgss gitrieiesm ssltpglret rtsqditsst 

45 9601 etstvlykms sgatpevsrt evmpssrtsi pgpaqstmsl disdewtrl stspimtesa 

9661 eitittqtgy slatsqvtlp Igtsratflsg thstrasqgls hsemtnlrasr gpeslswtsp 

9721 rfvettrsss sltslpltts Ispvsstlld sspssplpvt slilpglvkt tevldtssep 

9781 ktssspnlss tsveipatse imtdtekihp ssntavakvr tsssvheshs svladsetti 

9841 tipsragitsa vddttvftsn pafsetrrip teptfsltpg fretstseet tsitetsavl 

50 9901 ygvptsatte vsrateimssn rthipdsdqs tmspdiitev itrlssssram sestqmtitt 

9961 qksspgataq stltlattta plarthstvp prflhsemtt lmsrspenps wksspfvekt 

10021 Bssssllslp vttspsvsst lpqsipsssf svtslltpgm vkttdtstep gtslspnlsg 

10081 tsveilaase vttdtekihp sssmavtnvg ttssghelys svsihsepsk atypvgtpss 

10141 raaetsistsm panfettgfe aepfshltsg frktnrasldt ssvtptntps spgsthllqs 

55 10201 sktdftssak tsspdwppas qyteipvdii tpfnaspsit estgitsfpe srfttnsvtes 

10261 thhlstdllp saetistgtv mpslseamts fattgvprai sgsgspfsrt esgpgdatls 

10321 tiaeslpsst pvpfssstft ttdsstipal heitsssatp yrvdtslgte ssttegrlvm 

10381 vstldtssqp grtsstpild trmtesvelg tvtsayqvps lstrltrtdg imehitkipn 

10441 eaahrgtirp vkgpqtstsp aspkglhtgg tkrmetttta lkttttalkt tsratlttsv 

60 10501 ytptlgtltp lnasrqmast iltenunittp yvfpdvpett sslatslgae tstalprttp 

10561 svlnresett aslvsrsgae rspviqtldv sssepdttas wvihpaetip tvskttpnff 
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10621 hseldtvsst atshgadvss aiptnispse ldaltplvti sgtdtsttfp tltksphete 
10681 trttwlthpa etsstiprti pnfshhesda tpsiatspga etssaipimt vspgaedlvt 
10741 sqvtsegtdr nmtiptltls pgepktiasl vthpeaqtss aiptstispa vsrlvtsmvt 
10801 elaaktsttn raltnspgep attvslvthp aqtsptvpwt tsiffhsksd ttpsmttshg 
5 10861 aesssavptp tvstevpgw tplvtssrav isttipiltl spgepettpa matshgeeas 

10921 saiptptvsp gvpgwtslv tssravtstt ipiltfslge pettpsmats hgteagsavp 
10981 tvlpevpgmv tslvassrav tsttlptltl spgepettps matshgaeas stvptvapev 
11041 pgwtslvts Bsgvnstsip tlilspgele ttpsmatshg aeassavptp tvspgvsgw 
11101 tplvtssrav tsttipiltl sssepettps raatshgveaB savltvspev pgravtslvts 
10 11161 sravtsttip tltissdepe tttslvthse akmisaiptl avsptvqglv tslvtssgse 

11221 tsafsnltva Bsqpetidsw vahpgteass wptltvstg epftnislvt hpaessstlp 
11281 rttsrfshse Idtmpstvts peaesssais ttispgipgv ltslvtssgr disatfptvp 
11341 espheseata swvthpavts ttvprttpny shsepdttps iatspgaeat sdfptitvsp 
11401 dvpdmvtsqv tssgtdtsit iptltlssge petttsfity sethtssaip tlpvspgask 
15 11461 mltslvissg tdstttfptl tetpyepett aiqlihpaet ntravpkttpk fshBksdttl 

11521 pvaitspgpe assavsttti spdmsdlvts Ivpssgtdts ttfptlsetp yepettvtwl 
11581 thpaetsttv sgtipnfshr gsdtapsmvt spgvdtrsgv ptttippsip gwtsqvtss 
11641 atdtstaipt ltpspgepet tassathpgt qtgftvpirt vpssepdtma swvthppqts 
11701 tpvsrttssf shsspdatpv matsprteas savlttispg apentvtsqit ssgaatsttv 
11761 ptlthspgrap ettallsthp rtgtsktfpa stvfpqvset tasltirpga etstalptqt 
11821 tsslftllvt gtsrvdlspt aspgvsakta plsthpgtet stmiptstls lgllettgll 
11881 atsssaetst stltltvspa vsglssasit tdkpqtvtsw ntetspsvts vgppefsrtv 
11941 tgttmtlips emptppktsh gegvspttil rttnrveatnl attgssptva kttttfntla 
12 001 gslftplttp gmstlasesv tsrtsynhrs wisttssynr rywtpatstp vtstfspgis 
25 12061 tssipsstaa tvpfmvpftl nftitnlqye edmrhpgsrk fnaterelqg llkplfrnss 

12121 leylysgcrl aslrpekdss amavdaicth rpdpedlgld rerlywelsn ltngiqelgp 
12181 ytldrnslyv ngfthrssmp ttstpgtstv dvgtsgtpss spsptaagpl lrapftlnfti 
12241 tnlqyeedmr rtgsrkfntm esvlqgllkp lfkntsvgpl ysgcrltllr pekdgaatgv 
12301 daicthrldp kspglnreql ywelskltnd ieelgpytld rnslyvngft hqssvsttst 
30 12361 pgtstvdlrt sgtpsslssp timaagpllv pftlnftitn lqygedmghp gsrkfntter 

12421 vlqgllgpif kntsvgplys gcrltslrse kdgaatgvda icihhldpks pglnrerlyw 
12481 elsqltngik elgpytldrn slyvngfthr tsvpttstpg tstvdlgtsg tpfslpspat 
12541 agpllvlftl nftitnlkye edrahrpgsrk fnttervlqt llgpmfknts vgllysgcrl 
12601 tllrsekdga atgvdaicth rldpkspgld reqlywelsq ltngikelgp ytldrnslyv 
35 12661 ngfthwipvp tsBtpgtstv dlgsgtpBsl psptaagpll vpftlnftit nlqyeedmhh 

12721 pgsrkfntte rvlqgllgpm fkntsvglly sgcrltllrs ekdgaatgvd aicthrldpk 
12781 spgvdreqly welsqltngi kelgpytldr nslyvngfth qtsapntstp gtstvdlgts 
12841 gtpsslpspt sagpllvpft lnftitnlqy eedmrhpgsr kfnttervlq gllkplfkst 
12901 svgplysgcr ltllrsekdg aatgvdaict hrldpkspgv dreqlywels qltngikelg 
40 12961 pytldrnsly vngfthqtsa pntstpgtst vdlgtsgtps slpsptsagp llvpftlnf t 

13021 itnlqyeedm hhpgsrkfnt tervlqgllg pmfkntsvgl lysgcrltll rpekngaatg 
13081 mdaicshrld pkspglnreq lywelsqlth gikelgpytl drnslyvngf thrssvapts 
13141 tpgtstvdlg tsgtpsslps pttavpllvp ftlnftitnl qygedrarhpg srkfntterv 
13201 lqgllgplfk nssvgplysg crlislrsek dgaatgvdai cthhlnpqsp gldreqlywq 
45 13261 lsqmtngike lgpytldrns lyvngfthrs sglttstpwt stvdlgtsgt pspvpsptta 

13321 gpllvpftln ftitnlqyee dmhrpgsrkf nttervlqgl lspifknssv gplysgcrlt 
13381 slrpekdgaa tgmdavclyh pnpkrpgldr eqlywelsql thnitelgpy sldrdslyvn 
13441 gfthqnsvpt tstpgtstvy wattgtpssf pghtepgpll ipftfnftit nlhyeenmqh 
13501 pgsrkfntte rvlqgllkpl fkntsvgply sgcrltslrp ekdgaatgmd avclyhpnpk 
50 13561 rpgldreqly welsqlthni telgpysldr dslyvngfth qnsvpttstp gtstvywatt 

13621 gtpssfpght epgpllipft fnftitnlhy eenmqhpgsr kfnttervlq gllkplfknt 
13681 svgplysgcr ltllrpekhe aatgvdtict hrvdpigpgl drerlywels qltnsitelg 
13741 pytldrdsly vngfnprssv pttstpgtst vhlatsgtps slpghtapvp llipftlnft 
13801 itnlhyeenin qhpgsrkfnt tervlqgllk plfJcntsvgp lysgcrltll rpekheaatg 
55 13861 vdticthrvd pigpglxxex lywelsxltx xixelgpytl drxslyvngf thxxsxptts 

13921 tpgtstvxxg tsgtpssxpx xtsagpllvp ftlnftitnl qyeedmhhpg srkfntterv 
13981 Iqgllgpmfk ntsvgllysg crltllrpek ngaatgmdai cshrldpksp gldreqlywe 
14041 lsqlthgike lgpytldrns lyvngfthrs svaptstpgt stvdlgtsgt psslpsptta 
14101 vpllvpftln ftitnlqyge dmrhpgsrkf nttervlqgl lgplfknssv gplysgcrli 
60 14161 slrsekdgaa tgvdaicthh lnpqspgldr eqlywqlsqm tngikelgpy tldrnslyvn 

14221 gfthrssglt tstpwtstvd Igtsgtpspv pspttagpll vpftlnftit nlqyeedmhr 
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14281 pgsrkfnate rvlqgllspi fknssvgply sgcrltslrp ekdgaatgmd avclyhpnpk 
14341 rpgldreqly welsqlthni telgpysldr dslyvngfth qssmtttrtp dtstmhlats 
14401 rtpaslsgpt taspllvlft inctitnlqy eedmrrtgsr kfntmesvlq gllkplfknt 
14461 svgplysgcr ltllrpkkdg aatgvdaict hrldpkspgl nreqlywels kltndieelg 
5 14521 pytldmsly vngfthqssv sttstpgtst vdlrtsgtps slssptimxx xpllxpftxn 

14581 xtitnlxxxx xmxxpgsrkf nttervlqgl lrplfkntsv sslysgcrlt llrpekdgaa 
14641 trvdaactyr pdpkspgldr eqlywelsql thBitelgpy tldrvslyvn gfnprssvpt 
14701 tstpgtstvh latsgtpssl pghtxxxpll xpftxnxtit nlxxxxxmxx pgsrkfntte 
14761 rvlqgllkpl frnssleyly sgcrlaslrp ekdssamavd aicthrpdpe dlgldrerly 

10 14821 welsnltngi qelgpytldr nslyvngfth rssglttstp wtBtvdlgts gtpspvpspt 

14881 tagpllvpft Inftitnlqy eedmhrpgsr rfnttervlq glltplfknt svgplysgcr 
14941 ltllrpekqe aatgvdtict hrvdpigpgl drerlywels qltnsitelg pytldrdsly 
15001 vngfnpwssv pttstpgtst vhlatsgtps slpghtapvp llipftlnft itdlhyeenm 
15061 qhpgsrkfnt tervlqgllk plfkstsvgp lysgcrltll rpekhgaatg vdaictlrld 

15 15121 ptgpgldrer lywelsqltn svtelgpytl drdslyvngf thrssvptts ipgtsavhle 

15181 tsgtpaslpg htapgpllvp ftlnftitnl qyeedmrhpg srkfstterv Iqgllkplfk 
15241 ntsvsslysg crltllrpek dgaatrvdav cthrpdpksp gldrerlywk Isqlthgite 
15301 lgpytldrhs lyvngfthqs smtttrtpdt stmhlatsrt paslsgptta spllvlftin 
15361 ftitnlryee nmhhpgBrkf nttervlqgl lrpvfkntsv gplysgcrlt tlrpkkdgaa 

20 15421 tkvdaictyr pdpkspgldr eqlywelsql thsitelgpy tqdrdslyvn gfthrssvpt 

15481 tsipgtsavh letsgtpasl pghtapgpll vpftlnftit nlqyeedmrh pgsrkfntte 
15541 rvlqgllkpl fkstsvgply sgcrltllrp ekrgaatgvd ticthrldpl npgldreqly 
15601 welskltrgi ielgpylldr gslyvngfth rtsvpttstp gtstvdlgts gtpfslpspa 
15661 xxxpllxpf t xnxtitnlxx xxxmxxpgsr kfnttervlq tllgpmfknt svgllysgcr 

25 15721 ltllrsekdg aatgvdaict hrldpkspgv dreqlywels qltngikelg pytldrnsly 

15781 vngfthwipv ptsstpgtst vdlgsgtpss lpspttagpl lvpftlnfti tnlkyeedmh 
15841 cpgsrkfntt ervlqsllgp mfkntsvgpl ysgcrltllr sekdgaatgv daicthrldp 
15901 kspgvdreql ywelsqltng ikelgpytld mslyvngft hqtsapntst pgtstvdlgt 
15961 sgtpBslpsp txxxpllxpf txnxtitnlx xxxxmxxpgs rkfnttexvl qgllxpxfkn 

30 16021 xsvgxlysgc rltxlrxekx gaatgxdaic xhxxxpkxpg lxxexlywel sxltxxixel 

16081 gpytldrxsl yvngfthwip vptsstpgtB tvdlgsgtps slpspttagp llvpftlnft 
16141 itnlkyeedm hcpgsrkfnt tervlqsllg prafkntsvgp lysgcrltsl rsekdgaatg 
16201 vdaicthrvd pkspgvdreq lywelsqltn gikelgpytl drnslyvngf thqtsapnts 
16261 tpgtstvxxg tsgtpssxpx xtsagpllvp ftlnftitnl qyeedrahhpg srkfntterv 

35 16321 lqgllgpmfk ntsvgllysg crltllrpek ngattgmdai cthrldpksp glxxexlywe 

16381 lsxltxxixe lgpytldrxs lyvngfthxx sxpttstpgt stvxxgtsgt pssxpxxtxx 
16441 xpllxpftxn xtitnlxxxx xmxxpgsrkf nttervlqgl lkplfrnssl eylysgcrla 
16501 slrpekdssa mavdaicthr pdpedlgldr erlywelsnl tngiqelgpy tldrnslyvn 
16561 gfthrssmpt tstpgtstvd vgtsgtpsss pspttagpll ipftlnftit nlqygedmgh 

40 16621 pgsrkfntte rvlqgllgpi fkntsvgply sgcrltslrs ekdgaatgvd aicihhldpk 

16681 spglnrerly welsqltngi kelgpytldr nslyvngfth rtsvpttstp gtstvdlgts 
16741 gtpfslpspa tagpllvlft Inftitnlky eedmhrpgsr kfnttervlq tllgpmfknt 
16801 svgllysgcr ltllrsekdg aatgvdaict hrldpkspgl xxexlywels xltxxixelg 
16861 pytldrxsly vngfthxxsx pttstpgtst vxxgtsgtps sxpxxtxxxp llxpftxnxt 

45 16921 itnlxxxxxm xxpgsrkfnt tervlqgllr pvfkntsvgp lysgcrltll rpkkdgaatk 

16981 vdaictyrpd pkspgldreq lywelsqlth sitelgpytq drdslyvngf thrssvptts 
17041 ipgtsavhle ttgtpssfpg htepgpllip ftfnftitnl ryeenraqhpg srkfntterv 
17101 lqglltplfk ntsvgplysg crltllrpek qeaatgvdti cthrvdpigp gldrerlywe 
17161 lsqltnsite lgpytldrds lyvdgfnpws svpttstpgt stvhlatsgt psplpghtap 

50 17221 vpllipftln ftitdlhyee nmqhpgsrkf nttervlqgl lkplfkstsv gplysgcrlt 

17281 llrpekhgaa tgvdaictlr Idptgpgldr erlywelsql tnsitelgpy tldrdslyvn 
17341 gfnpwssvpt tstpgtstvh latsgtpssl pghttagpll vpftlnftit nlkyeedmhc 
174 01 pgsrkfntte rvlqslhgpm fkntsvgply sgcrltllrs ekdgaatgvd aicthrldpk 
17461 spglxxexly welsxltxxi xelgpytldr xslyvngfth xxsxpttstp gtstvxxgts 

55 17521 gtpssxpxxt xxxpllxpf t xnxtitnlxx xxxmxxpgsr kfnttexvlq gllxpxfknx 

17581 svgxlysgcr Itxlrxekxg aatgxdaicx hxxxpkxpgl xxexlywels xltnsitelg 
17641 pytldrdsly vngfthrssm pttsipgtsa vhletsgtpa slpghtapgp llvpftlnft 
17701 itnlqyeedm rhpgsrkfnt tervlqgllk plfkstsvgp lysgcrltll rpekrgaatg 
17761 vdticthrld plnpglxxex lywelsxltx xixelgpytl drxslyvngf thxxsxptts 

60 17821 tpgtstvxxg tsgtpssxpx xtxxxpllxp ftxnxtitnl xxxxxmxxpg srkfnttexv 

17881 lqgllxpxfk nxsvgxlysg crltxlrxek xgaatgxdai cxhxxxpkxp glxxexlywe 
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17941 lsxltxxixe lgpytldrxs lyvngfhprs svpttstpgt etvhlatagt psslpghtap 
18001 vpllipftln ftitnlhyee nmqhpgsrkf nttervlqgl lgpmfkntsv gllysgcrlt 
18061 llrpekngaa tgmdaicshr ldpkspglxx exlywelsxl txxixelgpy tldrxslyvn 
18121 gfthxxexpt tstpgtstvx xgtsgtpssx pxxtxxxpll xpftxnxtit nlxxxxxmxx 
5 18181 pgsrkfntte xvlqgllxpx fknxsvgxly sgcrltxlrx ekxgaatgxd aicxhxxxpk 

18241 xpglxxexly welsxltxxi xelgpytldr xslyvngfth qnsvpttstp gtstvywatt 
18301 gtpssfpght epgpllipft fnftitnlhy eenmqhpgsr Jcfnttervlq glltplfknt 
18361 svgplysgcr ltllrpekqe aatgvdtict hrvdpigpgl xxexlywels xltxxixelg 
18421 pytldrxsly vngfthxxsx pttetpgtst vxxgtsgtps sxpxxtxxxp llxpftxnxt 
10 18481 itnlxxxxxm xxpgsrkfnt texvlqgllx pxfknxsvgx lysgcrltxl rxekxgaatg 

18541 xdaicxhxxx pkxpglxxex lywelsxltx xixelgpytl drxslyvngf thrssvptts 
18601 spgtstvhla tsgtpsslpg htapvpllip ftlnftitnl hyeenmqhpg srkfntterv 
18661 lqgllkplfk stsvgplysg crltllrpek hgaatgvdal ctlrldptgp glxxexlywe 
18721 lsxltxxixe lgpytldrxs lyvngfthxx sxpttstpgt stvxxgtsgt pesxpxxtxx 
18781 xpllxpftxn xtitnlxxxx xraxxpgsrkf nttexvlqgl lxpxfknxsv gxlysgcrlt 
18841 xlrxekxgaa tgxdaicxhx xxpkxpglxx exlywelsxl txxixelgpy tldrxslyvn 
18901 gfthrtsvpt tstpgtstvh latsgtpssl pghtapvpll ipftlnftit nlqyeedmhr 
18961 pgsrkfntte rvlqgllspi fknssvgply sgcrltslrp ekdgaatgmd avclyhpnpk 
19021 rpgldreqly celsqlthni telgpysldr dslyvngfth qnsvpttstp gtstvywatt 
19081 gtpssfpght xxxpllxpft xnxtitnlxx xxxmxxpgsr kfnttexvlq gllxpxfknx 
19141 svgxlysgcr ltxlrxekxg aatgxdaicx hxxxpkxpgl xxexlywels xltxxixelg 
19201 pytldrxsly vngfthwssg lttstpwtst vdlgtsgtps pvpspttagp llvpftlnft 
19261 itnlqyeedra hrpgsrkfna tervlqglls pifkntsvgp lysgcrltll rpekqeaatg 
19321 vdticthrvd pigpglxxex lywelsxltx xixelgpytl drxslyvngf thxxsxptts 
25 19381 tpgtstvxxg tsgtpssxpx xtxxxpllxp ftxnxtitnl xxxxxmxxpg srkfnttexv 

19441 Iqgllxpxfk nxsvgxlysg crltxlrxek xgaatgxdai cxhxxxpkxp glxxexlywe 
19501 lsxltxxixe lgpytldrxs lyvngfthrs fglttstpwt stvdlgtsgt pspvpsptta 
19561 gpllvpftln ftitnlqyee dmhrpgsrkf nttervlqgl ltplfrntsv sslysgcrlt 
19621 llrpekdgaa trvdavcthr pdpkspglxx exlywelsxl txxixelgpy tldrxslyvn 
30 19681 gfthxxsxpt tstpgtstvx xgtsgtpssx pxxtxxxpll xpftxnxtit nlxxxxxmxx 

19741 pgsrkfntte xvlqgllxpx fknxsvgxly sgcrltxlrx ekxgaatgxd aicxhxxxpk 
19801 xpglxxexly welsxltxxi xelgpytldr xslyvngfth wipvptsstp gtstvdlgsg 
19861 tpsslpsptt agpllvpftl nftitnlqyg edmghpgsrk fnttervlqg llgpifknts 
19921 vgplysgcrl tslrsekdga atgvdaicih hldpkspglx xexlywelsx ltxxixelgp 
35 19981 ytldrxslyv ngfthxxsxp ttstpgtstv xxgtsgtpss xpxxtxxxpl lxpftxnxti 

20041 tnlxxxxxmx xpgsrkfntt exvlqgllxp xfknxsvgxl ysgcrltxlr xekxgaatgx 
20101 daicxhxxxp kxpglxxexl ywelsxltxx ixelgpytld rxslyvngft hqtfapntst 
20161 pgtstvdlgt sgtpsslpsp tsagpllvpf tlnftitnlq yeedmhhpgs rkfnttervl 
20221 qgllgpmfkn tsvgllysgc rltllrpekn gaatrvdavc thrpdpkspg Ixxexlywel 
40 20281 sxltxxixel gpytldrxsl yvngfthxxs xpttstpgts tvxxgtsgtp ssxpxxtapv 

2 0341 pllipftlnf titnlhyeen mqhpgsrkfn ttervlqgll kplfkstsvg plysgcrltl 
2 0401 Irpekhgaat gvdaictlrl dptgpgldre rlywelsqlt nsvtelgpyt ldrdslyvng 
20461 ftqrssvptt sipgtsavhl etsgtpaslp ghtapgpllv pftlnftitn lqyevdmrhp 
20521 gsrkfntter vlqgllkplf kstsvgplys gcrltllrpe krgaatgvdt icthrldpln 
45 20581 pgldreqlyw elskltrgii elgpylldrg slyvngfthr nfvpitstpg tstvhlgtse 

20641 tpsslprpiv pgpllvpftl nftitnlqye eamrhpgsrk fnttervlqg llrplfknts 
2 0701 igplysscrl tllrpekdka atrvdaicth hpdpqspgln reqlyweleq Ithgitelgp 
20761 ytldrdslyv dgfthwspip ttstpgtsiv nlgtsgipps lpettxxxpl lxpftxnxti 
2 0821 tnlxxxxxmx xpgsrkfntt ervlqgllkp lfkstsvgpl ysgcrltllr pekdgvatrv 
20881 daicthrpdp kipgldrqql ywelsqlths itelgpytld rdslyvngft qrssvpttst 
20941 pgtftvqpet setpsslpgp tatgpvllpf tlnftitnlq yeedmhrpgs rkfnttervl 
21001 qgllmplfkn tsvsslysgc rltllrpekd gaatrvdavc thrpdpkspg ldrerlywkl 
21061 sqlthgitel gpytldrhsl yvngfthqss mtttrtpdts tmhlatsrtp aslsgpttas 
21121 pllvlftinf titnlryeen mhhpgsrkfn ttervlqgll rpvfkntsvg plysgcrltl 
21181 lrpkkdgaat kvdaictyrp dpkspgldre qlywelsqlt hsitelgpyt ldrdslyvng 
21241 ftqrssvptt sipgtptvdl gtsgtpvskp gpsaaspllv lftlnftitn lryeenmqhp 
213 01 gsrkfntter vlqgllrslf kstsvgplys gcrltllrpe kdgtatgvda icthhpdpks 
21361 prldreqlyw elsqlthnit elghyaldnd slfvngfthr ssvsttstpg tptvylgask 
21421 tpasifgpsa ashllilftl nftitnlrye enrawpgsrkf nttervlqgl lrplfkntsv 
21481 gplysgsrlt llrpekdgea tgvdaicthr pdptgpgldr eqlylelsql thsitelgpy 
21541 tldrdslyvn gfthrssvpt tstgwseep ftlnftinnl rymadmgqpg slkfnitdnv 
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21601 mkhllsplfq rsslgarytg crvialrsvk ngaetrvdll ctylqplsgp glpikqvfhe 

21661 Isqqthgitr lgpysldkds lylngynepg ldeppttpkp attflpplse attamgyhlk 

21721 tltlnftisn Iqyspdmgkg satfnstegv lqhllrplfq kssmgpfylg cqliBlrpek 

21781 dgaatgvdtt ctyhpdpvgp gldiqqlywe lsqlthgvtq lgfyvldrds lfingyapqn 
5 21841 lsirgeyqin fhivnwnlsn pdptsseyit llrdiqdkvt tlykgsqlhd tfrfclvtnl 

21901 tradsvlvtvk alfBsnldps lveqvfldkt lnasfhwlgs tyqlvdihvt eraessvyqpt 

21961 sssstqhfyl nftitnlpys qdkaqpgttn yqmkmied alnqlfrnss iksyfsdcqv 

22021 stfrsvpnrh htgvdslcnf splarrvdrv aiyeeflrmt rngtqlquft ldrssvlvdg 

22 081 yspnrneplt gnsdlpfwav iliglagllg litclicgvl vttrrrkkeg eynvqqqcpg 
10 22141 yyqshldled lq 



SEQ ID NO. 2 
15 CA125 nucleic acid Genbank No. AF414442 
CDS 205. .66663 

1 aagcgttgca caattccccc aacctccata catacggcag ctcttctaga cacaggtttt 
61 cccaggtcaa atgcggggac cccagccata tctcccaccc tgagaaattt tggagtttca 
121 gggagctcag aagctctgca gaggccaccc tctctgaggg gattcttctt agacctccat 
20 181 ccagaggcaa atgttgacct gtccatgctg aaaccctcag gccttcctgg gtcatcttct 

241 cccacccgct ccttgatgac agggagcagg agcactaaag ccacaccaga aatggattca 
301 ggactgacag gagccacctt gtcacctaag acatctacag gtgcaatcgt ggtgacagaa 
361 catactctgc cctttacttc cccagataag accttggcca gtcctacatc ttcggttgtg 
421 ggaagaacca cccagtcttt gggggtgatg tcctctgctc tccctgagtc aacctctaga 
25 481 ggaatgacac actccgagca aagaaccagc ccatcgctga gtccccaggt caatggaact 

541 ccctctagga actaccctgc tacaagcatg gtttcaggat tgagttcccc aaggaccagg 
601 accagttcca cagaaggaaa ttttaccaaa gaagcatcta catacacact cactgtagag 
661 accacaagtg gcccagtcac tgagaagtac acagtcccca ctgagacctc aacaactgaa 
721 ggtgacagca cagagacccc ctgggacaca agatatattc ctgtaaaaat cacatctcca 
30 781 atgaaaacat ttgcagattc aactgcatcc aaggaaaatg ccccagtgtc tatgactcca 

841 gctgagacca cagttactga ctcacatact ccaggaagga caaacccatc atttgggaca 
901 ctttattctt ccttccttga cctatcacct aaagggaccc caaattccag aggtgaaaca 
961 agcctggaac tgattctatc aaccactgga tatcccttct cctctcctga acctggctct 
1021 gcaggacaca gcagaataag taccagtgcg cctttgtcat catctgcttc agttctcgat 
35 1081 aataaaatat cagagaccag catattctca ggccagagtc tcacctcccc tctgtctcct 

1141 ggggtgcccg aggccagagc cagcacaatg cccaactcag ctatcccttt ttccatgaca 
1201 ctaagcaatg cagaaacaag tgccgaaagg gtcagaagca caatttcctc tctggggact 
1261 ccatcaatat ccacaaagca gacagcagag actatcctta ccttccatgc cttcgctgag 
1321 accatggata tacccagcac ccacafcagcc aagactttgg cttcagaatg gttgggaagt 
40 1381 ccaggtaccc ttggtggcac cagcacttca gcgctgacaa ccacatctcc atctaccact 

1441 ttagtctcag aggagaccaa cacccatcac tccacgagtg gaaaggaaac agaaggaact 
1501 ttgaatacat ctatgactcc acttgagacc tctgctcctg gagaagagtc cgaaatgact 
1561 gccaccttgg tccccactct aggttttaca actcttgaca gcaagatcag aagtccatct 
1621 caggtctctt catcccaccc aacaagagag ctcagaacca caggcagcac ctctgggagg 
45 1681 cagagttcca gcacagctgc ccacgggagc tctgacatcc tgagggcaac cacttccagc 

1741 acctcaaaag catcatcatg gaccagtgaa agcacagctc agcaatttag tgaaccccag 
1801 cacacacagt gggtggagac aagtcctagc atgaaaacag agagaccccc agcatcaacc 
1861 agtgtggcag cccctatcac cacttctgtt ccctcagtgg tctctggctt caccaccctg 
1921 aagaccagct ccacaaaagg gatttggctt gaagaaacat ctgcagacac actcatcgga 
50 1981 gaatccacag ctggcccaac cacccatcag tttgctgttc ccactgggat ttcaatgaca 

2041 ggaggcagca gcaccagggg aagccagggc acaacccacc tactcaccag agccacagca 
2101 tcatctgaga catccgcaga tttgactctg gccacgaacg gtgtcccagt ctccgtgtct 
2161 ccagcagtga gcaagacggc tgctggctca agtcctccag gagggacaaa gccatcatat 
2221 acaatggttt cttctgtcat ccctgagaca tcatctctac agtcctcagc tttcagggaa 
55 2281 ggaaccagcc tgggactgac tccattaaac actagacafcc ccttctcttc ccctgaacca 

2341 gactctgcag gacacaccaa gataagcacc agcattcctc tgttgtcatc tgcttcagtt 
2401 cttgaggata aagtgtcagc gaccagcaca ttctcacacc acaaagccac ctcatctatt 
2461 accacaggga ctcctgaaat ctcaacaaag acaaagccca gctcagccgt tctttcctcc 
2521 atgaccctaa gcaatgcagc aacaagtcct gaaagagtca gaaatgcaac ttcccctctg 
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2581 actcatccat ctccatcagg ggaagagaca gcagggagtg tcctcactct cagcacctct 
2641 gctgagacta cagactcacc taacatccac ccaactggga cactgacttc agaatcgtca 
2701 gagagtccta gcactctcag cctcccaagt gtctctggag tcaaaaccac attttcttca 
2761 tctactcctt ccactcatct atttactagt ggagaagaaa cagaggaaac ttcgaatcca 
5 2821 tctgtgtctc aacctgagac ttctgtttcc agagtaagga ccaccttggc cagcacctct 

2881 gtccctaccc cagtattccc caccatggac acctggccta cacgttcagc tcagttctct 
2941 tcatcccacc tagtgagtga gctcagagct acgagcagta cctcagttac aaactcaact 
3001 ggttcagctc ttcctaaaat atctcacctc actgggacgg caacaatgtc acagaccaat 
3061 agagacacgt ttaatgactc tgctgcaccc caaagcacaa cttggccaga gactagtccc 
10 3121 agattcaaga cagggttacc ttcagcaaca accactgttt caacctctgc cacttctctc 

3181 tctgctactg taatggtctc taaattcact tctccagcaa ctagttccat ggaagcaact 
3241 tctatcaggg aaccatcaac aaccatcctc acaacagaga ccacgaatgg cccaggctct 
3301 atggctgtgg cttctaccaa catcccaatt ggaaagggct acattactga aggaagattg 
3361 gacacaagcc atctgcccat tggaaccaca gcttcctctg agacatctat ggattttacc 
15 3421 atggccaaag aaagtgtctc aatgtcagta tctccatctc agtccatgga tgctgctggc 

3481 tcaagcactc caggaaggac aagccaattc gttgacacat tttctgatga tgtctatcat 
3541 ttaacatcca gagaaattac aatacctaga gatggaacaa gctcagctct gactccacaa 
3601 atgactgcaa ctcaccctcc atctcctgat cctggctctg ctagaagcac ctggcttggc 
3661 atcttgtcct catctccttc ttctcctact cccaaagtca caatgagctc cacattttca 
20 3721 actcagagag tcaccacaag catgataatg gacacagttg aaactagtcg gtggaacatg 

3781 cccaacttac cttccacgac ttccctgaca ccaagtaata ttccaacaag tggtgccata 
3841 ggaaaaagca ccctggttcc cttggacact ccatctccag ccacatcatt ggaggcatca 
3901 gaagggggac ttccaaccct cagcacctac cctgaatcaa caaacacacc cagcatccac 
3961 ctcggagcac acgctagttc agaaagtcca agcaccatca aacttaccat ggcttcagta 
25 4021 gtaaaacctg gctcttacac acctctcacc ttcccctcaa tagagaccca cattcatgta 

4081 tcaacagcca gaatggctta ctcttctggg tcttcacctg agatgacagc tcctggagag 
4141 actaacactg gtagtacctg ggaccccacc acctacatca ccactacgga tcctaaggat 
4201 acaagttcag ctcaggtctc tacaccccac tcagtgagga cactcagaac cacagaaaac 
4261 catccaaaga cagagtccgc caccccagct gcttactctg gaagtcctaa aatctcaagt 
30 4321 tcacccaatc tcaccagtcc ggccacaaaa gcatggacca tcacagacac aactgaacac 

4381 tccactcaat tacattacac aaaattggca gaaaaatcat ctggatttga gacacagtca 
4441 gctccaggac ctgtctctgt agtaatccct acctccccta ccattggaag cagcacattg 
4501 gaactaactt ctgatgtccc aggggaaccc ctggtccttg ctcccagtga gcagaccaca 
4561 atcactctcc ccatggcaac atggctgagt accagtttga cagaggaaat ggcttcaaca 
35 4621 gaccttgata tttcaagtcc aagttcaccc atgagtacat ttgctatttt tccacctatg 

4681 tccacacctt ctcatgaact ttcaaagtca gaggcagata ccagtgccat tagaaataca 
4741 gattcaacaa cgttggatca gcacctagga atcaggagtt tgggcagaac tggggactta 
4801 acaactgttc ctatcacccc actgacaacc acgtggacca gtgtgattga acactcaaca 
4861 caagcacagg acaccctttc tgcaacgatg agtcctactc acgtgacaca gtcactcaaa 
40 4921 gatcaaacat ctataccagc ctcagcatcc ccttcccatc ttactgaagt ctaccctgag 

4981 ctcgggacac aagggagaag ctcctctgag gcaaccactt tttggaaacc atctacagac 
5041 acactgtcca gagagattga gactggccca acaaacattc aatccactcc acccatggac 
5101 aacacaacaa cagggagcag tagtagtgga gtcaccctgg gcatagccca ccttcccata 
5161 ggaacatcct ccccagctga gacatccaca aacatggcac tggaaagaag aagttctaca 
45 52 21 gccactgtct ctatggctgg gacaatggga ctccttgtta ctagtgctcc aggaagaagc 

5281 atcagccagt cattaggaag agtttcctct gtcctttctg agtcaactac tgaaggagtc 
5341 acagattcta gtaagggaag cagcccaagg ctgaacacac agggaaatac agctctctcc 
5401 tcctctcttg aacccagcta tgctgaagga agccagatga gcacaagcat ccctctaacc 
5461 tcatctccta caactcctga tgtggaattc atagggggca gcacattttg gaccaaggag 
50 5521 gtcaccacag ttatgacctc agacatctcc aagtcttcag caaggacaga gtccagctca 

5581 gctaccctta tgtccacagc tttgggaagc actgaaaata caggaaaaga aaaactcaga 
5641 actgcctcta tggatcttcc atctccaact ccatcaatgg aggtgacacc atggatttct 
5701 ctcactctca gtaatgcccc caataccaca gattcacttg acctcagcca tggggtgcac 
5761 accagctctg cagggacttt ggccactgac aggtcattga atactggtgt cactagagcc 
55 5821 tccagattgg aaaacggctc tgatacctct tctaagtccc tgtctatggg aaacagcact 

5881 cacacttcca tgactgacac agagaagagt gaagtgtctt cttcaatcca tccccgacct 
5941 gagacctcag ctcctggagc agagaccact ttgacttcca ctcctggaaa cagggccata 
6001 agcttaacat tgcctttttc atccattcca gtggaagaag tcatttctac aggcataacc 
6061 tcaggaccag acatcaactc agcacccatg acacattctc ccatcacccc accaacaatt 
6121 gtatggacca gtacaggcac aattgaacag tccactcaac cactacatgc agtttcttca 
6181 gaaaaagttt ctgtgcagac acagtcaact ccatatgtca actctgtggc agtgtctgct 
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6241 tcccctaccc atgagaattc agtctcttct ggaagcagca catcctctcc atattcctca 
6301 gcctcacttg aatccttgga ttccacaatc agtaggagga atgcaatcac ttcctggcta 
6361 tgggacctca ctacatctct ccccactaca acttggccaa gtactagttt atctgaggca 
6421 ctgtcctcag gccattctgg ggtttcaaac ccaagttcaa ctacgactga atttccactc 
5 6481 ttttcagctg catccacatc tgctgctaag caaagaaatc cagaaacaga gacccatggt 

6541 ccccagaata cagccgcgag tactttgaac actgatgcat cctcggtcac aggtctttct 
6601 gagactcctg tgggggcaag tatcagctct gaagtccctc ttccaatggc cataacttct 
6661 agatcagatg tttctggcct tacatctgag agtactgcta acccgagttt aggcacagcc 
6721 tcttcagcag ggaccaaatt aactaggaca atatccctgc ccacttcaga gtctttggtt 
10 6781 tcctttagaa tgaacaagga tccatggaca gtgtcaatcc ctttggggtc ccatccaact 

6841 actaatacag aaacaagcat cccagtaaac agcgcaggtc cacctggctt gtccacagta 
6901 gcatcagatg taattgacac accttcagat ggggctgaga gtattcccac tgtctccttt 
6961 tccccctccc ctgatactga agtgacaact atctcacatt tcccagaaaa gacaactcat 
7021 tcatttagaa ccatttcatc tctcactcat gagttgactt caagagtgac acctattcct 

15 7081 ggggattgga tgagttcagc tatgtctaca aagcccacag gagccagtcc ctccattaca 

7141 ctgggagaga gaaggacaat cacctctgct gctccaacca cttcccccat agttctcact 
7201 gctagtttca cagagaccag cacagtttca ctggataatg aaactacagt aaaaacctca 
7261 gatatccttg acgcacggaa aacaaatgag ctcccctcag atagcagttc ttcttctgat 
7321 ctgatcaaca cctccatagc ttcttcaact atggatgtca ctaaaacagc ctccatcagt 

20 73 81 cccactagca tctcaggaat gacagcaagt tcctccccat ctctcttctc ttcagataga 

7441 ccccaggttc ccacatctac aacagagaca aatacagcca cctctccatc tgtttccagt 
7501 aacacctatt ctcttgatgg gggctccaat gtgggtggca ctccatccac tttaccaccc 
7561 tttacaatca cccaccctgt cgagacaagc tcggccctat tagcctggtc tagaccagta 
7621 agaactttca gcaccatggt cagcactgac actgcctccg gagaaaatcc tacctctagc 

25 7681 aattctgtgg tgacttctgt tccagcacca ggtacatggg ccagtgtagg cagtactact 

7741 gacttacctg ccatgggctt tctcaagaca agtcctgcag gagaggcaca ctcacttcta 
7801 gcatcaacta ttgaaccagc cactgccttc actccccatc tctcagcagc agtggtcact 
7861 ggatccagtg ctacatcaga agccagtctt ctcactacga gtgaaagcaa agccattcat 
7921 tcttcaccac agaccccaac tacacccacc tctggagcaa actgggaaac ttcagctact 

30 7981 cctgagagcc ttttggtagt cactgagact tcagacacaa cacttacctc aaagattttg 

8041 gtcacagata ccatcttgtt ttcaactgtg tccacgccac cttctaaatt tccaagtacg 
8101 gggactctgt ctggagcttc cttccctact ttactcccgg acactccagc catccctctc 
8161 actgccactg agccaacaag ttcattagct acatcctttg attccacccc actggtgact 
8221 atagcttcgg atagtcttgg cacagtccca gagactaccc tgaccatgtc agagacctca 

35 8281 aatggtgatg cactggttct taagacagta agtaacccag ataggagcat ccctggaatc 

8341 actatccaag gagtaacaga aagtccactc catccttctt ccacttcccc ctctaagatt 
8401 gttgctccac ggaatacaac ctatgaaggt tcgatcacag tggcactttc tactttgcct 
8461 gcgggaacta ctggttccct tgtattcagt cagagttctg aaaactcaga gacaacggct 
8521 ttggtagact catcagctgg gcttgagagg gcatctgtga tgccactaac cacaggaagc 

40 8581 cagggtatgg ctagctctgg aggaatcaga agtgggtcca ctcactcaac tggaaccaaa 

8641 acattttctt ctctccctct gaccatgaac ccaggtgagg ttacagccat gtctgaaatc 
8701" accacgaaca gactgacagc tactcaatca acagcaccca aagggatacc tgtgaagccc 
8761 accagtgctg agtcaggcct cctaacacct gtctctgcct cctcaagccc atcaaaggcc 
8821 tttgcctcac tgactacagc tcccccatca acttggggga tcccacagtc taccttgaca 

45 8881 tttgagtttt ctgaggtccc aagtttggat actaagtccg cttctttacc aactcctgga 

8941 cagtccctga acaccattcc agactcagat gcaagcacag catcttcctc actgtccaag 
9001 tctccagaaa aaaacccaag ggcaaggatg atgacttcca caaaggccat aagtgcaagc 
9061 tcatttcaat caacaggttt tactgaaacc cctgagggat ctgcctcccc ttctatggca 
9121 gggcatgaac ccagagtccc cacttcagga acaggggacc ctagatatgc ctcagagagc 

50 9181 atgtcttatc cagacccaag caaggcatca tcagctatga catcgacctc tcttgcatca 

9241 aaactcacaa cfcctcttcag cacaggtcaa gcagcaaggt ctggttctag ttcctctccc 
9301 ataagcctat ccactgagaa agaaacaagc ttcctttccc ccactgcatc cacctccaga 
9361 aagacttcac tatttcttgg gccttccatg gcaaggcagc ccaacatatt ggtgcatctt 
9421 cagacttcag ctctgacact ttctccaaca tccactctaa atatgtccca ggaggagcct 

55 9481 cctgagttaa cctcaagcca gaccattgca gaagaagagg gaacaacagc tgaaacacag 

9541 acgttaacct tcacaccatc tgagacccca acatccttgt tacctgtctc ttctcccaca 
9601 gaacccacag ccagaagaaa gagttctcca gaaacatggg caagctctat ttcagttcct 
9661 gccaagacct ccttggttga aacaactgat ggaacgctag tgaccaccat aaagatgtca 
9721 agccaggcag cacaaggaaa ttccacgtgg cctgccccag cagaggagac ggggaccagt 

60 9781 ccagcaggca catccccagg aagcccagaa gtgtctacca ctctcaaaat catgagctcc 

9841 aaggaaccca gcatcagccc agagatcagg tccactgtgc gaaattctcc ttggaagact 
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9901 ccagaaacaa ctgttcccat ggagaccaca gtggaaccag tcacccttca gtccacagcc 
9961 ctaggaagtg gcagcaccag catctctcac ctgcccacag gaaccacatc accaaccaag 
10021 tcaccaacag aaaatatgtt ggctacagaa agggtctccc tctccccatc cccacctgag 
10081 gcttggacca acctttattc tggaactcca ggagggacca ggcagtcact ggccacaatg 
5 10141 tcctctgtct ccctagagtc accaactgct agaagcatca cagggactgg tcagcaaagc 

10201 agtccagaac tggtttcaaa gacaactgga atggaattct ctatgtggca tggctctact 
10261 ggagggacca caggggacac acatgtctct ctgagcacat cttccaatat ccttgaagac 
10321 cctgtaacca gcccaaactc tgtgagctca ttgacagata aatccaaaca taaaaccgag 
10381 acatgggtaa gcaccacagc cattccctcc actgtcctga ataataagat aatggcagct 
10 10441 gaacaacaga caagtcgatc tgtggatgag gcttattcat caactagttc ttggtcagat 

10501 cagacatctg ggagtgacat cacccttggt gcatctcctg atgtcacaaa cacattatac 
10S61 atcacctcca cagcacaaac cacctcacta gtgtctctgc cctctggaga ccaaggcatt 
10621 acaagcctca ccaatccctc aggaggaaaa acaagctctg cgtcatctgt cacatctcct 
10681 tcaatagggc ttgagactct gagggccaat gtaagtgcag tgaaaagtga cattgcccct 
15 10741 actgctgggc atctatctca gacttcatct cctgcggaag tgagcatcct ggacgtaacc 

10801 acagctccta ctccaggtat ctccaccacc atcaccacca tgggaaccaa ctcaatctca 
10861 actaccacac ccaacccaga agtgggtatg agtaccatgg acagcacccc ggccacagag 
10921 aggcgcacaa cttctacaga acacccttcc acctggtctt ccacagctgc atcagattcc 
10981 tggactgtca cagacatgac ttcaaacttg aaagttgcaa gatctcctgg aacaatttcc 
20 H041 acaatgcata caacttcatt cttagcctca agcactgaat tagactccat gtctactccc 

11101 catggccgta taactgtcat tggaaccagc ctggtcactc catcctctga tgcttcagct 
11161 gtaaagacag agaccagtac aagtgaaaga acattgagtc cttcagacac aactgcatct 
11221 actcccatct caactttttc tcgtgtccag aggatgagca tctcagttcc tgacatttta 
11281 agtacaagtt ggactcccag tagtacagaa gcagaagatg tgcctgtttc aatggtttct 
25 11341 acagatcatg ctagtacaaa gactgaccca aatacgcccc tgtccacttt tctgtttgat 

11401 tctctgtcca ctcttgactg ggacactggg agatctctgt catcagccac agccactacc 
11461 tcagctcctc agggggccac aactccccag gaactcactt tggaaaccat gatcagccca 
11521 gctacctcac agttgccctt ctctataggg cacattacaa gtgcagtcac accagctgca 
11581 atggcaagga gctctggagt tactttttca agaccagatc ccacaagcaa aaaggcagag 
11641 cagacttcca ctcagcttcc caccaccact tctgcacatc cagggcaggt gcccagatca 
11701 gcagcaacaa ctctggatgt gatcccacac acagcaaaaa ctccagatgc aacttttcag 
11761 agacaagggc agacagctct tacaacagag gcaagagcta catctgactc ctggaatgag 
11821 aaagaaaaat caaccccaag tgcaccttgg atcactgaga tgatgaattc tgtctcagaa 
11881 gataccatca aggaggttac cagctcctcc agtgtattaa aggaccctga atacgctgga 
35 11941 cataaacttg gaatctggga cgacttcatc cccaagtttg gaaaagcagc ccatatgaga 

12001 gagttgcccc ttctgagtcc accacaggac aaagaggcaa ttcacccttc tacaaacaca 
12061 gtagagacca caggctgggt cacaagttcc gaacatgctt ctcattccac tatcccagcc 
12121 cactcagcgt catccaaact cacatctcca gtggttacaa cctccaccag ggaacaagca 
12181 atagtttcta tgtcaacaac cacatggcca gagtctacaa gggctagaac agagcctaat 
40 12241 tccttcttga ctattgaact gagggacgtc agcccttaca tggacaccag ctcaaccaca 

12301 caaacaagta ttatctcttc cccaggttcc actgcgatca ccaaggggcc tagaacagaa 
12361 attacctcct ctaagagaat atccagctca ttccttgccc agtctatgag gtcgtcagac 
12421 agcccctcag aagccatcac caggctgtct aactttcctg ccatgacaga atctggagga 
12481 atgatccttg ctatgcaaac aagtccacct ggcgctacat cactaagtgc acctactttg 
45 12541 gatacatcag ccacagcctc ctggacaggg actccactgg ctacgactca gagatttaca 

12601 tactcagaga agaccactct ctttagcaaa ggtcctgagg atacatcaca gccaagccct 
12661 ccctctgtgg aagaaaccag ctcttcctct tccctggtac ctatccatgc tacaacctcg 
12721 ccttccaata ttttgttgac atcacaaggg cacagtccct cctctactcc acctgtgacc 
12781 tcagttttct tgtctgagac ctctggcctg gggaagacca cagacatgtc gaggataagc 
50 12841 ttggaacctg gcacaagttt acctcccaat ttgagcagta cagcaggtga ggcgttatcc 

12901 acttatgaag cctccagaga tacaaaggca attcatcatt ctgcagacac agcagtgacg 
12961 aatatggagg caaccagttc tgaatattct cctatcccag gccatacaaa gccatccaaa 
13021 gccacatctc cattggttac ctcccacatc atgggggaca tcacttcttc cacatcagta 
13081 tttggctcct ccgagaccac agagattgag acagtgtcct ctgtgaacca gggacttcag 
55 13141 gagagaagca catcccaggt ggccagctct gctacagaga caagcactgt cattacccat 

13201 gtgtctagtg gtgatgctac tactcatgtc accaagacac aagccacttt ctctagcgga 
13261 acatccatct caagccctca tcagtttata acttctacca acacatttac agatgtgagc 
13321 accaacccct ccacctctct gataatgaca gaatcttcag gagtgaccat caccacccaa 
13381 acaggtccta ctggagctgc aacacagggt ccatatctct tggacacatc aaccatgcct 
13441 tacttgacag agactccatt agctgtgact ccagatttta tgcaatcaga gaagaccact 
13501 ctcataagca aaggtcccaa ggatgtgacc tggacaagcc ctccctctgt ggcagaaacc 
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13561 agctatccct cttccctgac acctttcttg gtcacaacca tacctcctgc cacttccacg 
13621 ttacaagggc aacatacatc ctctcctgtt tctgcgactt cagttcttac ctctggactg 
13681 gtgaagacca cagatatgtt gaacacaagc atggaacctg tgaccaattc acctcaaaat 
13741 ttgaacaatc catcaaatga gatactggcc actttggcag ccaccacaga tatagagact 
5 13801 attcatcctt ccataaacaa agcagtgacc aatatgggga ctgccagttc agcacatgta 

13861 ctgcattcca ctctcccagt cagctcagaa ccatctacag ccacatctcc aatggttcct 
13921 gcctccagca tgggggacgc tcttgcttct atatcaatac ctggttctga gaccacagac 
13981 attgagggag agccaacatc ctccctgact gctggacgaa aagagaacag caccctccag 
14041 gagatgaact caactacaga gtcaaacatc atcctctcca atgtgtctgt gggggctatt 

10 14101 actgaagcca caaaaatgga agtcccctct tttgatgcaa cattcatacc aactcctgct 

14161 cagtcaacaa agttcccaga tattttctca gtagccagca gtagactttc aaactctcct 
14221 cccatgacaa tatctaccca catgaccacc acccagacag ggtcttctgg agctacatca 
14281 aagattccac ttgccttaga cacatcaacc ttggaaacct cagcagggac tccatcagtg 
14341 gtgactgagg ggtttgccca ctcaaaaata accactgcaa tgaacaatga tgtcaaggac 

15 14401 gtgtcacaga caaaccctcc ctttcaggat gaagccagct ctccctcttc tcaagcacct 

14461 gtccttgtca caaccttacc ttcttctgtt gctttcacac cgcaatggca cagtacctcc 
14521 tctcctgttt ctatgtcctc agttcttact tcttcactgg taaagaccgc aggcaaggtg 
14581 gatacaagct tagaaacagt gaccagttca cctcaaagta tgagcaacac tttggatgac 
14641 atatcggtca cttcagcagc caccacagat atagagacaa cgcatccttc cataaacaca 

20 14701 gtagttacca atgtggggac caccggttca gcatttgaat cacattctac tgtctcagct 

14761 tacccagagc catctaaagt cacatctcca aatgttacca cctccaccat ggaagacacc 
14821 acaatttccc gatcaatacc taaatcctct aagactacaa gaactgagac tgagacaact 
14881 tcctccctga ctcctaaact gagggagacc agcatctccc aggagatcac ctcgtccaca 
14941 gagacaagca ctgttcctta caaagagctc actggtgcca ctaccgaggt atccaggaca 

25 15001 gatgtcactt cctctagcag tacatccttc cctggccctg atcagtccac agtgtcacta 

15061 gacatctcca cagaaaccaa caccaggctg tctacctccc caataatgac agaatctgca 
15121 gaaataacca tcaccaccca aacaggtcct catggggcta catcacagga tacttttacc 
15181 atggacccat caaatacaac cccccaggca gggatccact cagctatgac tcatggattt 
15241 tcacaattgg atgtgaccac tcttatgagc agaattccac aggatgtatc atggacaagt 

30 15301 cctccctctg tggataaaac cagctccccc tcttcctttc tgtcctcacc tgcaatgacc 

15361 acaccttccc tgatttcttc taccttacca gaggataagc tctcctctcc tatgacttca 
15421 cttctcacct ctggcctagt gaagattaca gacatattac gtacacgctt ggaacctgtg 
15481 accagctcac ttccaaattt cagcagcacc tcagataaga tactggccac ttctaaagac 
15541 agtaaagaca caaaggaaat ttttccttct ataaacacag aagagaccaa tgtgaaagcc 

35 15601 aacaactctg gacatgaatc ccattcccct gcactggctg actcagagac acccaaagcc 

15661 acaactcaaa tggttatcac caccactgtg ggagatccag ctccttccac atcaatgcca 
15721 gtgcatggtt cctctgagac tacaaacatt aagagagagc caacatattt cttgactcct 
15781 agactgagag agaccagtac ctctcaggag tccagctttc ccacggacac aagttttcta 
15841 ctttccaaag tccccactgg tactattact gaggtctcca gtacaggggt caactcttct 

40 15901 agcaaaattt ccaccccaga ccatgataag tccacagtgc cacctgacac cttcacagga 

15961 gagatcccca gggtcttcac ctcctctatt aagacaaaat ctgcagaaat gacgatcacc 
16021 acccaagcaa gtcctcctga gtctgcatcg cacagtaccc ttcccttgga cacatcaacc 
16081 acactttccc agggagggac tcattcaact gtgactcagg gattcccata ctcagaggtg 
16141 accactctca tgggcatggg tcctgggaat gtgtcatgga tgacaactcc ccctgtggaa 

45 16201 gaaaccagct ctgtgtcttc cctgatgtct tcacctgcca tgacatcccc ttctcctgtt 

16261 tcctccacat caccacagag catcccctcc tctcctcttc ctgtgactgc acttcctact 
16321 tctgttctgg tgacaaccac agatgtgttg ggcacaacaa gcccagagtc tgtaaccagt 
16381 tcacctccaa atttgagcag catcactcat gagagaccgg ccacttacaa agacactgca 
16441 cacacagaag ccgccatgca tcattccaca aacaccgcag tgaccaatgt agggacttcc 

50 16501 gggtctggac ataaatcaca atcctctgtc ctagctgact cagagacatc gaaagccaca 

16561 cctctgatga gtaccacctc caccctgggg gacacaagtg tttccacatc aactcctaat 
16621 atctctcaga ctaaccaaat tcaaacagag ccaacagcat ccctgagccc tagactgagg 
16681 gagagcagca cgtctgagaa gaccagctca acaacagaga caaatactgc cttttcttat 
16741 gtgcccacag gtgctattac tcaggcctcc agaacagaaa tctcctctag cagaacatcc 

55 16801 atctcagacc ttgatcggcc cacaatagca cccgacatct ccacaggaat gatcaccagg 

16861 ctcttcacct cccccatcat gacaaaatct gcagaaatga ccgtcaccac tcaaacaact 
16921 actcctgggg ctacatcaca gggtatcctt ccttgggaca catcaaccac acttttccag 
16981 ggagggactc attcaaccgt gtctcaggga ttcccacact cagagataac cactcttcgg 
17041 agcagaaccc ctggagatgt gtcatggatg acaactcccc ctgtggaaga aaccagctct 

60 17101 gggttttccc tgatgtcacc ttccatgaca tccccttctc ctgtttcctc cacatcacca 

17161 gagagcatcc cctcctctcc tctccctgtg actgcacttc ttacttctgt tctggtgaca 
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17221 accaccaatg tattgggcac aacaagccca gagaccgtaa cgagttcacc tccaaattta 
17281 agcagcccca cacaggagag actgaccact tacaaagaca ctgcgcacac agaagccatg 
17341 catgcttcca tgcatacaaa cactgcagtg gccaacgtcg ggacctccat ttctggacat 
17401 gaatcacaat cttctgtccc agctgattca cacacatcca aagccacatc tccaatgggt 
17461 atcaccttcg ccatggggga tacaagtgtt tctacatcaa ctcctgcctt ctttgagact 
17521 agaattcaga ctgaatcaac atcctctttg attcctggat taagggacac caggacgtct 
17581 gaggagatca acactgtgac agagaccagc actgtccttt cagaagtgcc cactactact 
17641 actactgagg tctccaggac agaagttatc acttccagca gaacaaccat ctcagggcct 
17701 gatcattcca aaatgtcacc ctacatctcc acagaaacca tcaccaggct ctccactttt 
17761 ccttttgtaa caggatccac agaaatggcc atcaccaacc aaacaggtcc tatagggact 
17821 atctcacagg ctacccttac cctggacaca tcaagcacag cttcctggga agggactcac 
17881 tcacctgtga ctcagagatt tccacactca gaggagacca ctactatgag cagaagtact 
17941 aagggcgtgt catggcaaag ccctccctct gtggaagaaa ccagttctcc ttcttcccca 
18001 gtgcctttac ctgcaataac ctcacattca tctctttatt ccgcagtatc aggaagtagc 
18061 cccacttctg ctctccctgt gacttccctt ctcacctctg gcaggaggaa gaccatagac 
18121 atgttggaca cacactcaga acttgtgacc agctccttac caagtgcaag tagcttctca 
18181 ggtgagatac tcacttctga agcctccaca aatacagaga caattcactt ttcagagaac 
18241 acagcagaaa ccaatatggg gaccaccaat tctatgcata aactacattc ctctgtctca 
18301 atccactccc agccatccgg acacacacct ccaaaggtta ctggatctat gatggaggac 
18361 gctattgttt ccacatcaac acctggttct cctgagacta aaaatgttga cagagactca 
18421 acatcccctc tgactcctga actgaaagag gacagcaccg ccctggtgat gaactcaact 
18481 acagagtcaa acactgtttt ctccagtgtg tccctggatg ctgctactga ggtctccagg 
18541 gcagaagtca cctactatga tcctacattc atgccagctt ctgctcagtc aacaaagtcc 
18601 ccagacattt cacctgaagc cagcagcagt cattctaact ctcctccctt gacaatatct 
25 18661 acacacaaga ccatcgccac acaaacaggt ccttctgggg tgacatctct tggccaactg 

18721 accctggaca catcaaccat agccacctca gcaggaactc catcagccag aactcaggat 
18781 tttgtagatt cagaaacaac cagtgtcatg aacaatgatc tcaatgatgt gttgaagaca 
18841 agccctttct ctgcagaaga agccaactct ctctcttctc aggcacctct ccttgtgaca 
18901 acctcacctt ctcctgtaac ttccacattg caagagcaca gtacctcctc tcttgtttct 
30 18961 gtgacctcag tacccacccc tacactggcg aagatcacag acatggacac aaacttagaa 

19021 cctgtgactc gttcacctca aaatttaagg aacaccttgg ccacttcaga agccaccaca 
19081 gatacacaca caatgcatcc ttctataaac acagcaatgg ccaatgtggg gaccaccagt 
19141 tcaccaaatg aattctattt tactgtctca cctgactcag acccatataa agccacatcc 
19201 gcagtagtta tcacttccac ctcgggggac tcaatagttt ccacatcaat gcctagatcc 
35 19261 tctgcgatga aaaagattga gtctgagaca actttctccc tgatatttag actgagggag 

19321 actagcacct cccagaaaat tggctcatcc tcagacacaa gcacggtctt tgacaaagca 
19381 ttcactgctg ctactactga ggtctccaga acagaactca cctcctctag cagaacatcc 
19441 atccaaggca ctgaaaagcc cacaatgtca ccggacacct ccacaagatc tgtcaccatg 
19501 ctttctactt ttgctggcct gacaaaatcc gaagaaagga ccattgccac ccaaacaggt 
40 19561 cctcataggg cgacatcaca gggtaccctt acctgggaca catcaatcac aacctcacag 

19621 gcagggaccc actcagctat gactcatgga ttttcacaat tagatttgtc cactcttacg 
19681 agtagagttc ctgagtacat atcagggaca agcccaccct ctgtggaaaa aaccagctct 
19741 tcctcttccc ttctgtcttt accagcaata acctcaccgt cccctgtacc tactacatta 
19801 ccagaaagta ggccgtcttc tcctgttcat ctgacttcac tccccacctc tggcctagtg 
45 19861 aagaccacag atatgctggc atctgtggcc agtttacctc caaacttggg cagcacctca 

19921 cataagatac cgactacttc agaagacatt aaagatacag agaaaatgta tccttccaca 
19981 aacatagcag taaccaatgt ggggaccacc acttctgaaa aggaatctta ttcgtctgtc 
2 0041 ccagcctact cagaaccacc caaagtcacc tctccaatgg ttacctcttt caacataagg 
2 0101 gacaccattg tttccacatc catgcctggc tcctctgaga ttacaaggat tgagatggag 
20161 tcaacattct ccgtggctca tgggctgaag ggaaccagca cctcccagga ccccatcgta 
2 0221 tccacagaga aaagtgctgt ccttcacaag ttgaccactg gtgctactga gacctctagg 
2 0281 acagaagttg cctcttctag aagaacatcc attccaggcc ctgatcattc cacagagtca 
20341 ccagacatct ccactgaagt gatccccagc ctgcctatct cccttggcat tacagaatct 
20401 tcaaatatga ccatcatcac tcgaacaggt cctcctcttg gctctacatc acagggcaca 
2 0461 tttaccttgg acacaccaac tacatcctcc agggcaggaa cacactcgat ggcgactcag 
20521 gaatttccac actcagaaat gaccactgtc atgaacaagg accctgagat tctatcatgg 
20581 acaatccctc cttctataga gaaaaccagc ttctcctctt ccctgatgcc ttcaccagcc 
20641 atgacttcac ctcctgtttc ctcaacatta ccaaagacca ttcacaccac tccttctcct 
20701 atgacctcac tgctcacccc tagcctagtg atgaccacag acacattggg cacaagccca 
2 0761 gaacctacaa ccagttcacc tccaaatttg agcagtacct cacatgtgat actgacaaca 
20821 gatgaagaca ccacagctat agaagccatg catccttcca caagcacagc agcgactaat 
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20881 gtggaaacca cctgttctgg acatgggtca caatcctctg tcctaactga ctcagaaaaa 
20941 accaaggcca cagctccaat ggataccacc tccaccatgg ggcatacaac tgtttccaca 
21001 tcaatgtctg tttcctctga gactacaaaa attaagagag agtcaacata ttccttgact 
21061 cctggactga gagagaccag catttcccaa aatgccagct tttccactga cacaagtatt 
5 21121 gttctttcag aagtccccac tggtactact gctgaggtct ccaggacaga agtcacctcc 

21181 tctggtagaa catccatccc tggcccttct cagtccacag ttttgccaga aatatccaca 
21241 agaacaatga caaggctctt tgcctcgccc accatgacag aatcagcaga aatgaccatc 
21301 cccactcaaa caggtccttc tgggtctacc tcacaggata cccttacctt ggacacatcc 
21361 accacaaagt cccaggcaaa gactcattca actttgactc agagatttcc acactcagag 
10 21421 atgaccactc tcatgagcag aggtcctgga gatatgtcat ggcaaagctc tccctctctg 

21481 gaaaatccca gctctctccc ttccctgctg tctttacctg ccacaacctc acctcctccc 
21541 atttcctcca cattaccagt gactatctcc tcctctcctc ttcctgtgac ttcacttctc 
21601 acctctagcc cggtaacgac cacagacatg ttacacacaa gcccagaact tgtaaccagt 
21661 tcacctccaa agctgagcca cacttcagat gagagactga ccactggcaa ggacaccaca 
1$ 21721 aatacagaag ctgtgcatcc ttccacaaac acagcagcgt ccaatgtgga gattcccagc 

21781 tttggacatg aatccccttc ctctgcctta gctgactcag agacatccaa agccacatca 
21841 ccaatgttta ttacctccac ccaggaggat acaactgttg ccatatcaac ccctcacttc 
21901 ttggagacta gcagaattca gaaagagtca atttcctccc tgagccctaa attgagggag 
21961 acaggcagtt ctgtggagac aagctcagcc atagagacaa gtgctgtcct ttctgaagtg 
20 22 021 tccattggtg ctactactga gatctccagg acagaagtca cctcctctag cagaacatcc 

22081 atctctggtt ctgctgagtc cacaatgttg ccagaaatat ccaccacaag aaaaatcatt 
22141 aagttcccta cttcccccat cctggcagaa tcatcagaaa tgaccatcaa gacccaaaca 
22201 agtcctcctg ggtctacatc agagagtacc tttacattag acacatcaac cactccctcc 
22261 ttggtaataa cccattcgac tatgactcag agattgccac actcagagat aaccactctt 
25 22321 gtgagtagag gtgctgggga tgtgccacgg cccagctctc tccctgtgga agaaacaagc 

22381 cctccatctt cccagctgtc tttatctgcc atgatctcac cttctcctgt ttcttccaca 
22441 ttaccagcaa gtagccactc ctcttctgct tctgtgactt cacctctcac accaggccaa 
22 501 gtgaagacta ctgaggtgtt ggacgcaagt gcagaacctg aaaccagttc acctccaagt 
22561 ttgagcagca cctcagttga aatactggcc acctctgaag tcaccacaga tacggagaaa 
22621 attcatcctt tcccaaacac ggcagtaacc aaagttggaa cttccagttc tggacatgaa 
22 681 tccccttcct ctgtcctacc tgactcagag acaaccaaag ccacatcggc aatgggtacc 
22741 atctccatta tgggggatac aagtgtttct acattaactc ctgccttatc taacactagg 
22801 aaaattcagt cagagccagc ttcctcactg accaccagat tgagggagac cagcacctct 
22861 gaagagacca gcttagccac agaagcaaac actgttcttt ctaaagtgtc cactggtgct 
35 22921 actactgagg tctccaggac agaagccatc tcctttagca gaacatccat gtcaggccct 

22981 gagcagtcca caatgtcaca agacatctcc ataggaacca tccccaggat ttctgcctcc 
23041 tctgtcctga cagaatctgc aaaaatgacc atcacaaccc aaacaggtcc ttcggagtct 
23101 acactagaaa gtacccttaa tttgaacaca gcaaccacac cctcttgggt ggaaacccac 
23161 tctatagtaa ttcagggatt tccacaccca gagatgacca cttccatggg cagaggtcct 
23221 ggaggtgtgt catggcctag ccctcccttt gtgaaagaaa ccagccctcc atcctccccg 
23281 ctgtctttac ctgccgtgac ctcacctcat cctgtttcca ccacattcct agcacatatc 
23341 cccccctctc cccttcctgt gacttcactt ctcacctctg gcccggcgac aaccacagat 
23401 atcttgggta caagcacaga acctggaacc agttcatctt caagtttgag caccacctcc 
23461 catgagagac tgaccactta caaagacact gcacatacag aagccgtgca tccttccaca 
45 23521 aacacaggag ggaccaatgt ggcaaccacc agctctggat ataaatcaca gtcctctgtc 

23581 ctagctgact catctccaat gtgtaccacc tccaccatgg gggatacaag tgttctcaca 
23641 tcaactcctg ccttccttga gactaggagg attcagacag agctagcttc ctccctgacc 
23701 cctggattga gggagtccag tggctctgaa gggaccagct caggcaccaa gatgagcact 
23761 gtcctctcta aagtgcccac tggtgctact actgagatct ccaaggaaga cgtcacctcc 
50 23821 atcccaggtc ccgctcaatc cacaatatca ccagacatct ccacaagaac cgtcagctgg 

23881 ttctctacat cccctgtcat gacagaatca gcagaaataa ccatgaacac ccatacaagt 
23941 cctttagggg ccacaacaca aggcaccagt actttggcca cgtcaagcac aacctctttg 
24001 acaatgacac actcaactat atctcaagga ttttcacact cacagatgag cactcttatg 
24061 aggaggggtc ctgaggatgt atcatggatg agccctcccc ttctggaaaa aactagacct 
55 24121 tccttttctc tgatgtcttc accagccaca acttcacctt ctcctgtttc ctccacatta 

24181 ccagagagca tctcttcctc tcctcttcct gtgacttcac tcctcacgtc tggcttggca 
24241 aaaactacag atatgttgca caaaagctca gaacctgtaa ccaactcacc tgcaaatttg 
24301 agcagcacct cagttgaaat actggccacc tctgaagtca ccacagatac agagaaaact 
24361 catccttctt caaacagaac agtgaccgat gtggggacct ccagttctgg acatgaatcc 
60 24421 acttcctttg tcctagctga ctcacagaca tccaaagtca catctccaat ggttattacc 

24481 tccaccatgg aggatacgag tgtctccaca tcaactcctg gcttttttga gactagcaga 
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24541 attcagacag aaccaacatc ctccctgacc cttggactga gaaagaccag cagctctgag 
24601 gggaccagct tagccacaga gatgagcact gtcctttctg gagtgcccac tggtgccact 
24661 gctgaagtct ccaggacaga agtcacctcc tctagcagaa catccatctc aggctttgct 
24721 cagctcacag tgtcaccaga gacttccaca gaaaccatca ccagactccc tacctccagc 
5 24781 ataatgacag aatcagcaga aatgatgatc aagacacaaa cagatcctcc tgggtctaca 

24841 ccagagagta ctcatactgt ggacatatca acaacaccca actgggtaga aacccactcg 
24901 actgtgactc agagattttc acactcagag atgaccactc ttgtgagcag aagccctggt 
24961 gatatgttat ggcctagtca atcctctgtg gaagaaacca gctctgcctc ttccctgctg 
25021 tctctgcctg ccacgacctc accttctcct gtttcctcta cattagtaga ggatttccct 
10 25081 tccgcttctc ttcctgtgac ttctcttctc acccctggcc tggtgataac cacagacagg 

25141 atgggcataa gcagagaacc tggaaccagt tccacttcaa atttgagcag cacctcccat 
25201 gagagactga ccactttgga agacactgta gatacagaag acatgcagcc ttccacacac 
25261 acagcagtga ccaacgtgag gacctccatt tctggacatg aatcacaatc ttctgtccta 
25321 tctgactcag agacacccaa agccacatct ccaatgggta ccacctacac catgggggaa 
15 25381 acgagtgttt ccatatccac ttctgacttc tttgagacca gcagaattca gatagaacca 

25441 acatcctccc tgacttctgg attgagggag accagcagct ctgagaggat cagctcagcc 
25501 acagagggaa gcactgtcct ttctgaagtg cccagtggtg ctaccactga ggtctccagg 
25561 acagaagtga tatcctctag gggaacatcc atgtcagggc ctgatcagtt caccatatca 
25621 ccagacatct ctactgaagc gatcaccagg ctttctactt cccccattat gacagaatca 
20 2 5681 gcagaaagtg ccatcactat tgagacaggt tctcctgggg ctacatcaga gggtaccctc 

25741 accttggaca cctcaacaac aaccttttgg tcagggaccc actcaactgc atctccagga 
25801 ttttcacact cagagatgac cactcttatg agtagaactc ctggagatgt gccatggccg 
25861 agccttccct ctgtggaaga agccagctct gtctcttcct cactgtcttc acctgccatg 
25921 acctcaactt cttttttctc cgcattacca gagagcatct cctcctctcc tcatcctgtg 
25981 actgcacttc tcacccttgg cccagtgaag accacagaca tgttgcgcac aagctcagaa 
2 6041 cctgaaacca gttcacctcc aaatttgagc agcacctcag ctgaaatatt agccacgtct 
26101 gaagtcacca aagatagaga gaaaattcat ccctcctcaa acacacctgt agtcaatgta 
26161 gggactgtga tttataaaca tctatcccct tcctctgttt tggctgactt agtgacaaca 
26221 aaacccacat ctccaatggc taccacctcc actctgggga atacaagtgt ttccacatca 
30 26281 actcctgcct tcccagaaac tatgatgaca cagccaactt cctccctgac ttctggatta 

26341 agggagatca gtacctctca agagaccagc tcagcaacag agagaagtgc ttctctttct 
26401 ggaatgccca ctggtgctac tactaaggtc tccagaacag aagccctctc cttaggcaga 
26461 acatccaccc caggtcctgc tcaatccaca atatcaccag aaatctccac ggaaaccatc 
2 6521 actagaattt ctactcccct caccacgaca ggatcagcag aaatgaccat cacccccaaa 
35 2 6581 acaggtcatt ctggggcatc ctcacaaggt acctttacct tggacacatc aagcagagcc 

2 6641 tcctggccag gaactcactc agctgcaact cacagatctc cacactcagg gatgaccact 
2 6701 cctatgagca gaggtcctga ggatgtgtca tggccaagcc gcccatcagt ggaaaaaact 
2 6761 agccctccat cttccctggt gtctttatct gcagtaacct caccttcgcc actttattcc 
26821 acaccatctg agagtagcca ctcatctcct ctccgggtga cttctctttt cacccctgtc 
40 26881 atgatgaaga ccacagacat gttggacaca agcttggaac ctgtgaccac ttcacctccc 

26941 agtatgaata tcacctcaga tgagagtctg gccacttcta aagccaccat ggagacagag 
27001 gcaattcagc tttcagaaaa cacagctgtg actcagatgg gcaccatcag cgctagacaa 
27061 gaattctatt cctcttatcc aggcctccca gagccatcca aagtgacatc tccagtggtc 
27121 acctcttcca ccataaaaga cattgtttct acaaccatac ctgcttcctc tgagataaca 
45 27181 agaattgaga tggagtcaac atccaccctg acccccacac caagggagac cagcacctcc 

27241 caggagatcc actcagccac aaagccaagc actgttcctt acaaggcact cactagtgcc 
27301 acgattgagg actccatgac acaagtcatg tcctctagca gaggacctag ccctgatcag 
27361 tccacaatgt cacaagacat atccagtgaa gtgatcacca ggctctctac ctcccccatc 
27421 aaggcagaat ctacagaaat gaccattacc acccaaacag gttctcctgg ggctacatca 
50 27481 aggggtaccc ttaccttgga cacttcaaca acttttatgt cagggaccca ctcaactgca 

27541 tctcaaggat tttcacactc acagatgacc gctcttatga gtagaactcc tggagatgtg 
27601 ccatggctaa gccatccctc tgtggaagaa gccagctctg cctctttctc actgtcttca 
27661 cctgtcatga cctcatcttc tcccgtttct tccacattac cagacagcat ccactcttct 
27721 tcgcttcctg tgacatcact tctcacctca gggctggtga agaccacaga gctgttgggc 
55 27781 acaagctcag aacctgaaac cagttcaccc ccaaatttga gcagcacctc agctgaaata 

27841 ctggccacca ctgaagtcac tacagataca gagaaactgg agatgaccaa tgtggtaacc 
27901 tcaggttata cacatgaatc tccttcctct gtcctagctg actcagtgac aacaaaggcc 
27961 acatcttcaa tgggtatcac ctaccccaca ggagatacaa atgttctcac atcaacccct 
28021 gccttctctg acaccagtag gattcaaaca aagtcaaagc tctcactgac tcctgggttg 
60 28081 atggagacca gcatctctga agagaccagc tctgccacag aaaaaagcac tgtcctttct 

28141 agtgtgccca ctggtgctac tactgaggtc tccaggacag aagccatctc ttctagcaga 
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28201 acatccatcc caggccctgc tcaatccaca atgtcatcag acacctccat ggaaaccatc 
28261 actagaattt ctacccccct cacaaggaaa gaatcaacag acatggccat cacccccaaa 
28321 acaggtcctt ctggggctac ctcgcagggt acctttacct tggactcatc aagcacagcc 
28381 tcctggccag gaactcactc agctacaact cagagatttc cacagtcagt ggtgacaact 
5 28441 cctatgagca gaggtcctga ggatgtgtca tggccaagcc cgctgtctgt ggaaaaaaac 

28501 agccctccat cttccctggt atcttcatct tcagtaacct caccttcgcc actttattcc 
28561 acaccatctg ggagtagcca ctcctctcct gtccctgtca cttctctttt cacctctatc 
28621 atgatgaagg ccacagacat gttggatgca agtttggaac ctgagaccac ttcagctccc 
28681 aatatgaata tcacctcaga tgagagtctg gccacttcta aagccaccac ggagacagag 
10 28741 gcaattcacg tttttgaaaa tacagcagcg tcccatgtgg aaaccaccag tgctacagag 

2 8801 gaactctatt cctcttcccc aggcttctca gagccaacaa aagtgatatc tccagtggtc 
28861 acctcttcct ctataagaga caacatggtt tccacaacaa tgcctggctc ctctggcatt 
28921 acaaggattg agatagagtc aatgtcatct ctgacccctg gactgaggga gaccagaacc 
28981 tcccaggaca tcacctcatc cacagagaca agcactgtcc tttacaagat gtcctctggt 

15 29041 gccactcctg aggtctccag gacagaagtt atgccctcta gcagaacatc cattcctggc 

29101 cctgctcagt ccacaatgtc actagacatc tccgatgaag ttgtcaccag gctgtctacc 
29161 tctcccatca tgacagaatc tgcagaaata accatcacca cccaaacagg ttattctctg 
29221 gctacatccc aggttaccct tcccttgggc acctcaatga cctttttgtc agggacccac 
29281 tcaactatgt ctcaaggact ttcacactca gagatgacca atcttatgag caggggtcct 

20 29341 gaaagtctgt catggacgag ccctcgcttt gtggaaacaa ctagatcttc ctcttctctg 

29401 acatcattac ctctcacgac ctcactttct cctgtgtcct ccacattact agacagtagc 
29461 ccctcctctc ctcttcctgt gacttcactt atcctcccag gcctggtgaa gactacagaa 
29521 gtgttggata caagctcaga gcctaaaacc agttcatctc caaatttgag cagcacctca 
29581 gttgaaatac cggccacctc tgaaatcatg acagatacag agaaaattca tccttcctca 

25 29641 aacacagcgg tggccaaagt gaggacctcc agttctgttc atgaatctca ttcctctgtc 

29701 ctagctgact cagaaacaac cataaccata ccttcaatgg gtatcacctc cgctgtggac 
29761 gataccactg ttttcacatc aaatcctgcc ttctctgaga ctaggaggat tccgacagag 
29821 ccaacattct cattgactcc tggattcagg gagactagca cctctgaaga gaccacctca 
29881 atcacagaaa caagtgcagt cctttatgga gtgcccacta gtgctactac tgaagtctcc 

30 29941 atgacagaaa tcatgtcctc taatagaaca cacatccctg actctgatca gtccacgatg 

30001 tctccagaca tcatcactga agtgatcacc aggctctctt cctcatccat gatgtcagaa 
30061 tcaacacaaa tgaccatcac cacccaaaaa agttctcctg gggctacagc acagagtact 
30121 cttaccttgg ccacaacaac agcccccttg gcaaggaccc actcaactgt tcctcctaga 
30181 tttttacact cagagatgac aactcttatg agtaggagtc ctgaaaatcc atcatggaag 

35 30241 agctctccct ttgtggaaaa aactagctct tcatcttctc tgttgtcctt acctgtcacg 

303 01 acctcacctt ctgtttcttc cacattaccg cagagtatcc cttcctcctc tttttctgtg 
30361 acttcactcc tcaccccagg catggtgaag actacagaca caagcacaga acctggaacc 
30421 agtttatctc caaatctgag tggcacctca gttgaaatac tggctgcctc tgaagtcacc 
30481 acagatacag agaaaattca tccttcttca agcatggcag tgaccaatgt gggaaccacc 

40 30541 agttctggac atgaactata ttcctctgtt tcaatccact cggagccatc caaggctaca 

30601 tacccagtgg gtactccctc ttccatggct gaaacctcta tttccacatc aatgcctgct 

3 0661 aattttgaga ccacaggatt tgaggctgag ccattttctc atttgacttc tggatttagg 
30721 aagacaaaca tgtccctgga caccagctca gtcacaccaa caaatacacc ttcttctcct 
30781 gggtccactc accttttaca gagttccaag actgatttca cctcttctgc aaaaacatca 

45 30841 tccccagact ggcctccagc ctcacagtat actgaaattc cagtggacat aatcaccccc 

30901 tttaatgctt ctccatctat tacggagtcc actgggataa cctccttccc agaatccagg 
30961 tttactatgt ctgtaacaga aagtactcat catctgagta cagatttgct gccttcagct 
31021 gagactattt ccactggcac agtgatgcct tctctatcag aggccatgac ttcatttgcc 
31081 accactggag ttccacgagc catctcaggt tcaggtagtc cattctctag gacagagtca 

50 31141 ggccctgggg atgctactct gtccaccatt gcagagagcc tgccttcatc cactcctgtg 

31201 ccattctcct cttcaacctt cactaccact gattcttcaa ccatcccagc cctccatgag 
31261 ataacttcct cttcagctac cccatataga gtggacacca gtcttgggac agagagcagc 
3132 i actactgaag gacgcttggt tatggtcagt actttggaca cttcaagcca accaggcagg 
31381 acatcttcaa cacccatttt ggataccaga atgacagaga gcgttgagct gggaacagtg 

55 31441 acaagtgctt atcaagttcc ttcactctca acacggttga caagaactga tggcattatg 

31501 gaacacatca caaaaatacc caatgaagca gcacacagag gtaccataag accagtcaaa 
31561 ggccctcaga catccacttc gcctgccagt cctaaaggac tacacacagg agggacaaaa 
31621 agaatggaga ccaccaccac agctttgaag accaccacca cagctttgaa gaccacttcc 
31681 agagccacct tgaccaccag tgtctatact cccactttgg gaacactgac tcccctcaat 

60 31741 gcatcaaggc aaatggccag cacaatcctc acagaaatga tgatcacaac cccatatgtt 

31801 ttccctgatg ttccagaaac gacatcctca ttggctacca gcctgggagc agaaaccagc 
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agtcatgggg 
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actaagtccc 
agctcaacta 
tcaatagcca 
cctggtgcag 
actattccaa 
catcctgaag 
cggttggtga 
ctgacaaact 
accagcccaa 
ccttcaatga 
tcaactgagg 
acaactattc 
accagtcatg 
ccaggagtgg 
attctgactt 
acagaagctg 
ctggttgcta 
ggtgaaccag 
gttccaactg 
ggagtaaaca 
ccttcaatgg 
tcacctgggg 
acaactattc 
accagtcatg 
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atgatttcag 
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caaccagaga 
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gcagagagta 
actatgcctt 
atttcacctg 
agtgcaactt 
gttactcatc 
agtgaaccag 
tttccaacaa 
tctgggacag 
accacaacct 
cctgtctccc 
agcactacaa 
cagctcattc 
catagtaagt 
tcagctgttt 
cctagttctg 
ccagagacta 
acaattccca 
ggagtagaca 
gtgacctcac 
ccttctcctg 
ggcttcactg 
gtcactcatc 
agtagtccag 
gtactgacaa 
ggggcagcaa 



ccaggacaac 
gttctggggc 
atacaacagc 
cccccaattt 
cagacgtcag 
cactggtcac 
cacatgaaac 
ttcccagaac 
ccagtcctgg 
aagatctggt 
ctttgactct 
cacagacaag 
cctcaatggt 
cccctggtga 
cagttccctg 
ccaccagtca 
taccaggagt 
caattctgac 
gggaagaagc 
tgacctctct 
tttctcttgg 
gctcagctgt 
gttctagggc 
agaccacacc 
tttcacctga 
gtacaagtat 
ccaccagtca 
tatcaggagt 
caattctaac 
gggtagaagc 
ctctggtcac 
ctgatgaacc 
ccattccaac 
gttctgggtc 
ccatagactc 
ctgtctccac 
gctcaactct 
ctacagtcac 
gtataccagg 
ttccaacagt 
ctgcagtcac 
acaccacacc 
taactgtctc 
acaccagtat 
catttatcac 
ctggtgcatc 
ctttcccaac 
atcctgcaga 
cagacaccac 
caacgacaac 
ggacagacac 
cagtcacgtg 
acttttccca 
cgaggtcagg 
aggtcactag 
gtgaaccaga 
ttccaattcg 
ctccacagac 
atgccacacc 
caatctcacc 
ccagtacaac 



cccatctgtt 
agagagaagt 
ttcatgggtt 
tttccacagt 
ctcagccatt 
tatttcgggg 
agagacaaga 
aatccccaat 
ggcagaaacc 
gacctcacag 
ttctcctggt 
ttcggccatt 
caccagtttg 
accagctaca 
gacaacttcc 
tggggcagaa 
agtgacccct 
tctttctcct 
cagttctgct 
ggtcactagt 
tgaaccagag 
tccaactgtt 
agtaaccagt 
ttcaatggcc 
ggtaccagga 
tccaactctg 
tggggcagaa 
ggtgacccct 
tctttcttct 
cagctcagct 
tagttctaga 
agagaccaca 
tttagctgtc 
agagaccagt 
atgggtcgct 
tggtgagccg 
tcccaggaca 
cagtcct^ag 
tgtgctgaca 
gcctgagtcc 
cagcacaaca 
atcaatagcc 
acctgatgta 
aactattcca 
ctattctgag 
aaagatgctg 
actgacggag 
gaccaacaca 
actcccagta 
tatctcacct 
cagtacaacc 
gctcactcat 
taggggatca 
tgttccaact 
ttctgcaaca 
gaccacagcc 
gactgttccc 
cagcacacct 
tgtaatggcc 
tggtgcacca 
tgttccaact 



ctcaatagag 
ccggttattc 
atccatcctg 
gaattagaca 
ccaacaaata 
acagatacta 
accacatggc 
ttttctcatc 
agttcagcta 
gtcactagtt 
gaaccaaaga 
ccaacttcaa 
gcggcaaaga 
acagtttcat 
atttttttcc 
tccagttcag 
ttggtcacca 
ggtgaaccag 
attccaactc 
tctagggcag 
accacacctt 
ttacctgagg 
acaactcttc 
accagtcatg 
gtggtgacct 
attctttctc 
gccagctcag 
ctggtcacta 
agtgagccag 
gttctaactg 
gcagtaacca 
acttcattgg 
tcccctactg 
gcgttttcaa 
catcctggga 
tttacaaata 
acctcaaggt 
gcagaatcca 
tcactggtca 
ccacatgaat 
gttcccagga 
accagtcctg 
ccagatatgg 
actctgactc 
acacacacaa 
acctcactgg 
accccatatg 
atggttccca 
gccatcacca 
gatatgtcag 
ttcccaacat 
cctgcagaaa 
gacactgcac 
acaaccatcc 
gacactagta 
tcatcagcta 
tctagtgagc 
gtttccagaa 
accagtccta 
gagatggtga 
ttgactcatt 



aatcagagac 
aaactctaga 
cagagaccat 
ctgtatcttc 
tctcacctag 
gtacaacatt 
tcactcatcc 
atgaatcaga 
ttccaattat 
ctgggacaga 
cgatagcctc 
ctatctcgcc 
caagtacaac 
tggtcacgca 
atagtaaatc 
ctgttccaac 
gttctagggc 
agaccacacc 
caactgtttc 
tgactagtac 
caatggccac 
taccaggaat 
caactctgac 
gggcagaagc 
ctctggtcac 
ctggtgaact 
ctgttccaac 
gttccagggc 
agaccacacc 
tttcacctga 
gtacaactat 
tcacccattc 
tacaagggct 
atctaactgt 
cagaagcaag 
tctcattggt 
tttcccacag 
gctcagccat 
ctagctctgg 
cagaggcaac 
caacccctaa 
gggcagaagc 
taacctcaca 
tttcttctgg 
gttcagccat 
tcatcagttc 
aaccagagac 
agacaactcc 
gtcctgggcc 
atctggtgac 
tgagtgagac 
ccagcacaac 
cctcaatggt 
cacccagtat 
cagctattcc 
cccatcctgg 
cagatacaat 
caacctccag 
ggacagaagc 
cttcacagat 
ctcctggtat 



cacagcctca 
tgtttcttct 
cccaactgtt 
cacagccacc 
tgaactagat 
cccaacactg 
tgcagagacc 
tgccacacct 
gactgtctca 
cagaaatatg 
attagtcacc 
tgctgtatca 
taatcgagct 
tcctgcacag 
agacaccaca 
tccaactgtt 
agtgatcagt 
ttcaatggcc 
acctggggta 
aactattcca 
cagtcatggg 
ggtgacctct 
tctttctcct 
cagctcaact 
tagttctagt 
agaaaccaca 
tccaactgtt 
agtgaccagt 
ttcaatggcc 
ggtaccagga 
tccaactctg 
tgaggcaaag 
ggtgacttca 
tgcctcaagt 
ttctgttgtt 
cacccatcct 
tgaattagac 
ttcaactact 
gagagacatc 
agcctcatgg 
ttattctcat 
cacttcagat 
ggtcactagt 
tgagccagag 
tccaactctc 
tgggacagac 
aacagccata 
caagttttcc 
agaagccagt 
ctcactggtc 
cccatatgaa 
ggtttctggg 
caccagtcct 
accaggggta 
aactttgact 
gacacagact 
ggcttcctgg 
tttttcccat 
cagttcagct 
cactagttct 
gccagagacc 
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35521 acagccttat tgagcaccca tcccagaaca gggacaagta aaacatttcc tgcttcaact 
35581 gtgtttcctc aagtatcaga gaccacagcc tcactcacca ttagacctgg tgcagagact 
3 5641 agcacagctc tcccaactca gacaacatcc tctctcttca ccctacttgt aactggaacc 
35701 agcagagttg atctaagtcc aactgcttca cctggtgttt ctgcaaaaac agccccactt 
5 35761 tccacccatc cagggacaga gaccagcaca atgattccaa cttcaactct ttcccttggt 

35821 ttactagaga ctacaggctt actggccacc agctcttcag cagagaccag cacgagtact 
3 5881 ctaactctga ctgtttcccc tgctgtctct gggctttcca gtgcctctat aacaactgat 
35941 aagccccaaa ctgtgacctc ctggaacaca gaaacctcac catctgtaac ttcagttgga 
3 6001 cccccagaat tttccaggac tgtcacaggc accactatga ccttgatacc atcagagatg 

10 36061 ccaacaccac ctaaaaccag tcatggagaa ggagtgagtc caaccactat cttgagaact 

36121 acaatggttg aagccactaa tttagctacc acaggttcca gtcccactgt ggccaagaca 
36181 acaaccacct tcaatacact ggctggaagc ctctttactc ctctgaccac acctgggatg 
36241 tccaccttgg cctctgagag tgtgacctca agaacaagtt ataaccatcg gtcctggatc 
3 6301 tccaccacca gcagttataa ccgtcggtac tggacccctg ccaccagcac tccagtgact 

15 36361 tctacattct ccccagggat ttccacatcc tccatcccca gctccacagc agccacagtc 

36421 ccattcatgg tgccattcac cctcaacttc accatcacca acctgcagta cgaggaggac 
3 6481 atgcggcacc ctggttccag gaagttcaac gccacagaga gagaactgca gggtctgctc 
36541 aaacccttgt tcaggaatag cagtctggaa tacctctatt caggctgcag actagcctca 
3 6601 ctcaggccag agaaggatag ctcagccatg gcagtggatg ccatctgcac acatcgccct 

20 36661 gaccctgaag acctcggact ggacagagag cgactgtact gggagctgag caatctgaca 

36721 aatggcatcc aggagctggg cccctacacc ctggaccgga acagtctcta tgtcaatggt 
36781 ttcacccatc gaagctctat gcccaccacc agcactcctg ggacctccac agtggatgtg 
36841 ggaacctcag ggactccatc ctccagcccc agccccacgg ctgctggccc tctcctgatg 
36901 ccgttcaccc tcaacttcac catcaccaac ctgcagtacg aggaggacat gcgtcgcact 

25 36961 ggctccagga agttcaacac catggagagt gtcctgcagg gtctgctcaa gcccttgttc 

37021 aagaacacca gtgttggccc tctgtactct ggctgcagat tgaccttgct caggcccgag 
3 7081 aaagatgggg cagccactgg agtggatgcc atctgcaccc accgccttga ccccaaaagc 
37141 cctggactca acagggagca gctgtactgg gagctaagca aactgaccaa tgacattgaa 
37201 gagctgggcc cctacaccct ggacaggaac agtctctatg tcaatggttt cacccatcag 

30 37261 agctctgtgt ccaccaccag cactcctggg acctccacag tggatctcag aacctcaggg 

37321 actccatcct ccctctccag ccccacaatt atggctgctg gccctctcct ggtaccattc 
37381 accctcaact tcaccatcac caacctgcag tatggggagg acatgggtca ccctggctcc 
37441 aggaagttca acaccacaga gagggtcctg cagggtctgc ttggtcccat attcaagaac 
37501 accagtgttg gccctctgta ctctggctgc agactgacct ctctcaggtc tgagaaggat 

35 37561 ggagcagcca ctggagtgga tgccatctgc atccatcatc ttgaccccaa aagccctgga 

37621 ctcaacagag agcggctgta ctgggagctg agccaactga ccaatggcat caaagagctg 
37681 ggcccctaca ccctggacag gaacagtctc tatgtcaatg gtttcaccca tcggacctct 
37741 gtgcccacca ccagcactcc tgggacctcc acagtggacc ttggaacctc agggactcca 
37801 ttctccctcc caagccccgc aactgctggc cctctcctgg tgctgttcac cctcaacttc 

40 37861 accatcacca acctgaagta tgaggaggac atgcatcgcc ctggctccag gaagttcaac 

37921 accactgaga gggtcctgca gactctgctt ggtcctatgt tcaagaacac cagtgttggc 
37981 cttctgtact ctggctgcag actgaccttg ctcaggtccg agaaggatgg agcagccact 
38 041 ggagtggatg ccatctgcac ccaccgtctt gaccccaaaa gccctggact ggacagagag 
38101 cagctatact gggagctgag ccagctgacc aatggcatca aagagctggg cccctacacc 

45 38161 ctggacagga acagtctcta tgtcaatggt ttcacccatt ggatccctgt gcccaccagc 

38221 agcactcctg ggacctccac agtggacctt gggtcaggga ctccatcctc cctccccagc 
38281 cccacagctg ctggccctct cctggtgcca ttcaccctca acttcaccat caccaacctg 
38341 cagtacgagg aggacatgca tcacccaggc tccaggaagt tcaacaccac ggagcgggtc 
38401 ctgcagggtc tgcttggtcc catgttcaag aacaccagtg tcggccttct gtactctggc 

50 38461 tgcagactga ccttgctcag gtccgagaag gatggagcag ccactggagt ggatgccatc 

38521 tgcacccacc gtcttgaccc caaaagccct ggagtggaca gggagcagct atactgggag 
38581 ctgagccagc tgaccaatgg catcaaagag ctgggtccct acaccctgga cagaaacagt 
38641 ctctatgtca atggtttcac ccatcagacc tctgcgccca acaccagcac tcctgggacc 
38701 tccacagtgg accttgggac ctcagggact ccatcctccc tccccagccc tacatcngct 

55 38761 ggccctctcc tggtnccntt caccctcaac ttcaccatca ccaacctgca gtacgaggag 

38821 gacatgcggc acccnggntc caggaagttc aacaccacng agagggtnct gcagggtctg 
38881 ctnaagcccc tnttcaagag caccagtgtt ggccctctgt actctggctg cagactgacc 
38941 ttgctcaggt ccgagaagga tggagcagcc actggagtgg atgccatctg cacccaccgt 
39001 cttgacccca aaagccctgg agtggacagg gagcagctat actgggagct gagccagctg 

60 39061 accaatggca tcaaagagct gggtccctac accctggaca gaaacagtct ctatgtcaat 

39121 ggtttcaccc atcagacctc tgcgcccaac accagcactc ctgggacctc cacagtggac 
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39181 cttgggacct cagggactcc atcctccctc cccagcccta catctgctgg ccctctcctg 
39241 gtgccattca ccctcaactt caccatcacc aacctgcagt acgaggagga catgcatcac 
39301 ccaggctcca ggaagttcaa caccacggag cgggtcctgc agggtctgct tggtcccatg 
39361 ttcaagaaca ccagtgtcgg ccttctgtac tctggctgca gactgacctt gctcaggcct 
5 39421 gagaagaatg gggcagccac tggaatggat gccatctgca gccaccgtct tgaccccaaa 

39481 agccctggac tcaacagaga gcagctgtac tgggagctga gccagctgac ccatggcatc 
39541 aaagagctgg gcccctacac cctggacagg aacagtctct atgtcaatgg tttcacccat 
39601 cggagctctg tggcccccac cagcactcct gggacctcca cagtggacct tgggacctca 
39661 gggactccat cctccctccc cagccccaca acagctgttc ctctcctggt gccgttcacc 
10 39721 ctcaacttta ccatcaccaa tctgcagtat ggggaggaca tgcgtcaccc tggctccagg 

39781 aagttcaaca ccacagagag ggtcctgcag ggtctgcttg gtcccttgtt caagaactcc 
39841 agtgtcggcc ctctgtactc tggctgcaga ctgatctctc tcaggtctga gaaggatggg 
39901 gcagccactg gagtggatgc catctgcacc caccacctta accctcaaag ccctggactg 
39961 gacagggagc agctgtactg gcagctgagc cagatgacca atggcatcaa agagctgggc 
15 40021 ccctacaccc tggaccggaa cagtctctac gtcaatggtt tcacccatcg gagctctggg 

40081 ctcaccacca gcactccttg gacttccaca gttgaccttg gaacctcagg gactccatcc 
4 0141 cccgtcccca gccccacaac tgctggccct ctcctggtgc cattcaccct caacttcacc 
40201 atcaccaacc tgcagtatga ggaggacatg catcgccctg gatctaggaa gttcaacacc 
40261 acagagaggg tcctgcaggg tctgcttagt cccattttca agaactccag tgttggccct 
20 40321 ctgtactctg gctgcagact gacctctctc aggcccgaga aggatggggc agcaactgga 

40381 atggatgctg tctgcctcta ccaccctaat cccaaaagac ctggactgga cagagagcag 
40441 ctgtactggg agctaagcca gctgacccac aacatcactg agctgggccc ctacagcctg 
40501 gacagggaca gtctctatgt caatggtttc acccatcaga actctgtgcc caccaccagt 
40561 actcctggga cctccacagt gtactgggca accactggga ctccatcctc cttccccggc 
25 40621 cacacagagc ctggccctct cctgatacca ttcactttca actttaccat caccaacctg 

40681 cattatgagg aaaacatgca acaccctggt tccaggaagt tcaacaccac ggagagggtt 
40741 ctgcagggtc tgctcaagcc cttgttcaag aacaccagtg ttggccctct gtactctggc 
40801 tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 
40861 tgcctctacc accctaatcc caaaagacct gggctggaca gagagcagct gtactgggag 
30 40921 ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 

40981 ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 
41041 tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacagagcct 
41101 ggccctctcc tgataccatt cactttcaac tttaccatca ccaacctgca ttatgaggaa 
41161 aacatgcaac accctggttc caggaagttc aacaccacgg agagggttct gcagggtctg 
35 41221 ctcaagccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagactgacc 

41281 ttgctcagac ctgagaagca tgaggcagcc actggagtgg acaccatctg tacccaccgc 
41341 gttgatccca tcggacctgg actggacagg gagcggctat actgggagct gagccagctg 
41401 accaacagca ttaccgaact gggaccctac accctggaca gggacagtct ctatgtcaat 
41461 ggcttcaacc ctcggagctc tgtgccaacc accagcactc ctgggacctc cacagtgcac 
40 41521 ctggcaacct ctgggactcc atcctccctg cctggccaca cagcccctgt ccctctcttg 

41581 ataccattca ccctcaactt taccatcacc aacctgcatt atgaggaaaa catgcaacac 
41641 cctggttcca ggaagttcaa caccacggag agggttctgc agggtctgct caagcccttg 
41701 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcagacct 
41761 gagaagcatg aggcagccac tggagtggac accatctgta cccaccgcgt tgatcccatc 
41821 ggacctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 
41881 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
41941 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 
42001 gggactccat cctccntccc cngccncaca tctgctggcc ctctcctggt gccattcacc 
42061 ctcaacttca ccatcaccaa cctgcagtac gaggaggaca tgcatcaccc aggctccagg 
50 42121 aagttcaaca ccacggagcg ggtcctgcag ggtctgcttg gtcccatgtt caagaacacc 

42181 agtgtcggcc ttctgtactc tggctgcaga ctgaccttgc tcaggcctga gaagaatggg 
42241 gcagccactg gaatggatgc catctgcagc caccgtcttg accccaaaag ccctggactc 
42301 gacagagagc agctgtactg ggagctgagc cagctgaccc atggcatcaa agagctgggc 
42361 ccctacaccc tggacaggaa cagtctctat gtcaatggtt tcacccatcg gagctctgtg 
55 42421 gcccccacca gcactcctgg gacctccaca gtggaccttg ggacctcagg gactccatcc 

42481 tccctcccca gccccacaac agctgttcct ctcctggtgc cgttcaccct caactttacc 
42541 atcaccaatc tgcagtatgg ggaggacatg cgtcaccctg gctccaggaa gttcaacacc 
42601 acagagaggg tcctgcaggg tctgcttggt cccttgttca agaactccag tgtcggccct 
42661 ctgtactctg gctgcagact gatctctctc aggtctgaga aggatggggc agccactgga 
60 42721 gtggatgcca tctgcaccca ccaccttaac cctcaaagcc ctggactgga cagggagcag 

42781 ctgtactggc agctgagcca gatgaccaat ggcatcaaag agctgggccc ctacaccctg 
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42841 gaccggaaca gtctctacgt caatggtttc acccatcgga gctctgggct caccaccagc 
42901 actccttgga cttccacagt tgaccttgga acctcaggga ctccatcccc cgtccccagc 
42961 cccacaactg ctggccctct cctggtgcca ttcaccctaa acttcaccat caccaacctg 
43021 cagtatgagg aggacatgca tcgccctgga tctaggaagt tcaacgccac agagagggtc 
5 43081 ctgcagggtc tgcttagtcc catattcaag aactccagtg ttggccctct gtactctggc 

43141 tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 
43201 tgcctctacc accctaatcc caaaagacct ggactggaca gagagcagct gtactgggag 
43261 ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 
43321 ctctatgtca atggtttcac ccatcagagc tctatgacga ccaccagaac tcctgatacc 
10 43381 tccacaatgc acctggcaac ctcgagaact ccagcctccc tgtctggacc tacgaccgcc 

43441 agccctctcc tggtgctatt cacaatcaac tgcaccatca ccaacctgca gtacgaggag 
43501 gacatgcgtc gcactggctc caggaagttc aacaccatgg agagtgtcct gcagggtctg 
43561 ctcaagccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagattgacc 
43621 ttgctcaggc ccaagaaaga tggggcagcc actggagtgg atgccatctg cacccaccgc 
43681 cttgacccca aaagccctgg actcaacagg gagcagctgt actgggagct aagcaaactg 
43741 accaatgaca ttgaagagct gggcccctac accctggaca ggaacagtct ctatgtcaat 
43801 ggtttcaccc atcagagctc tgtgtccacc accagcactc ctgggacctc cacagtggat 
43861 ctcagaacct cagggactcc atcctccctc tccagcccca caattatgnc nnctgnccct 
43921 ctcctgntnc cnttcaccnt caacttnacc atcaccaacc tgcantangn ggaixnacatg 
43981 cnncncccng gntccaggaa gttcaacacc acngagaggg tcctacaggg tctgctcagg 
44041 cccttgttca agaacaccag tgtcagctct ctgtactctg gttgcagact gaccttgctc 
44101 aggcctgaga aggatggggc agccaccaga gtggatgctg cctgcaccta ccgccctgat 
44161 cccaaaagcc ctggactgga cagagagcaa ctatactggg agctgagcca gctaacccac 
44221 agcatcactg agctgggacc ctacaccctg gacagggtca gtctctatgt caatggcttc 
25 44281 aaccctcgga gctctgtgcc aaccaccagc actcctggga cctccacagt gcacctggca 

44341 acctctggga ctccatcctc cctgcctggc cacacancnn ctgnccctct cctgntnccn 
44401 ttcaccntca acttnaccat caccaacctg cantangngg annacatgcn ncncccnggn 
44461 tccaggaagt tcaacaccac ngagagggtt ctgcagggtc tgctcaaacc cttgttcagg 
44521 aatagcagtc tggaatacct ctattcaggc tgcagactag cctcactcag gccagagaag 
44581 gatagctcag ccatggcagt ggatgccatc tgcacacatc gccctgaccc tgaagacctc 
44641 ggactggaca gagagcgact gtactgggag ctgagcaatc tgacaaatgg catccaggag 
44701 ctgggcccct acaccctgga ccggaacagt ctctacgtca atggtttcac ccatcggagc 
44761 tctgggctca ccaccagcac tccttggact tccacagttg accttggaac ctcagggact 
44821 ccatcccccg tccccagccc cacaactgct ggccctctcc tggtgccatt caccctcaac 
35 44881 ttcaccatca ccaacctgca gtatgaggag gacatgcatc gccctggttc caggaggttc 

44941 aacaccacgg agagggttct gcagggtctg ctcacgccct tgttcaagaa caccagtgtt 
45001 ggccctctgt actctggctg cagactgacc ttgctcagac ctgagaagca agaggcagcc 
45061 actggagtgg acaccatctg tacccaccgc gttgatccca tcggacctgg actggacaga 
45121 gagcggctat actgggagct gagccagctg accaacagca tcacagagct gggaccctac 
40 45181 accctggata gggacagtct ctatgtcaat ggcttcaacc cttggagctc tgtgccaacc 

45241 accagcactc ctgggacctc cacagtgcac ctggcaacct ctgggactcc atcctccctg 
4 5301 cctggccaca cagcccctgt ccctctcttg ataccattca ccctcaactt taccatcacc 
4 5361 gacctgcatt atgaagaaaa catgcaacac cctggttcca ggaagttcaa caccacggag 
45421 agggttctgc agggtctgct caagcccttg ttcaagagca ccagcgttgg ccctctgtac 
45 45481 tctggctgca gactgacctt gctcagacct gagaaacatg gggcagccac tggagtggac 

45541 gccatctgca ccctccgcct tgatcccact ggtcctggac tggacagaga gcggctatac 
45601 tgggagctga gccagctgac caacagcgtt acagagctgg gcccctacac cctggacagg 
45661 gacagtctct atgtcaatgg cttcacccat cggagctctg tgccaaccac cagtattcct 
45721 gggacctctg cagtgcacct ggaaacctct gggactccag cctccctccc tggccacaca 
50 45781 gcccctggcc ctctcctggt gccattcacc ctcaacttca ctatcaccaa cctgcagtat 

45841 gaggaggaca tgcgtcaccc tggttccagg aagttcagca ccacggagag agtcctgcag 
45901 ggtctgctca agcccttgtt caagaacacc agtgtcagct ctctgtactc tggttgcaga 
45961 ctgaccttgc tcaggcctga gaaggatggg gcagccacca gagtggatgc tgtctgcacc 
4 6021 catcgtcctg accccaaaag ccctggactg gacagagagc ggctgtactg gaagctgagc 
55 46081 cagctgaccc acggcatcac tgagctgggc ccctacaccc tggacaggca cagtctctat 

46141 gtcaatggtt tcacccatca gagctctatg acgaccacca gaactcctga tacctccaca 
46201 atgcacctgg caacctcgag aactccagcc tccctgtctg gacctacgac cgccagccct 
46261 ctcctggtgc tattcacaat taacttcacc atcactaacc tgcggtatga ggagaacatg 
46321 catcaccctg gctctagaaa gtttaacacc acggagagag tccttcaggg tctgctcagg 
60 46381 cctgtgttca agaacaccag tgttggccct ctgtactctg gctgcagact gaccacgctc 

46441 aggcccaaga aggatggggc agccaccaaa gtggatgcca tctgcaccta ccgccctgat 
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46501 cccaaaagcc ctggactgga cagagagcag ctatactggg agctgagcca gctaacccac 

46561 agcatcactg agctgggccc ctacacccag gacagggaca gtctctatgt caatggcttc 

46621 acccatcgga gctctgtgcc aaccaccagt attcctggga cctctgcagt gcacctggaa 

46681 acctctggga ctccagcctc cctccctggc cacacagccc ctggccctct cctggtgcca 

5 46741 ttcaccctca acttcactat caccaacctg cagtatgagg aggacatgcg tcaccctggt 

46801 tccaggaagt tcaacaccac ggagagagtc ctgcagggtc tgctcaagcc cttgttcaag 

46861 agcaccagtg ttggccctct gtactctggc tgcagactga ccttgctcag gcctgaaaaa 

46921 cgtggggcag ccaccggcgt ggacaccatc tgcactcacc gccttgaccc tctaaaccca 

469B1 ggactggaca gagagcagct atactgggag ctgagcaaac tgacccgtgg catcatcgag 

10 47041 ctgggcccct acctcctgga cagaggcagt ctctatgtca atggtttcac ccatcggacc 

47101 tctgtgccca ccaccagcac tcctgggacc tccacagtgg accttggaac ctcagggact 

47161 ccattctccc tcccaagccc cgcancnnct gnccctctcc tgntnccntt caccntcaac 

47221 ttnaccatca ccaacctgca ntangnggan nacatgcnnc ncccnggntc caggaagttc 

47281 aacaccacng agagggtcct gcagactctg cttggtccta tgttcaagaa caccagtgtt 

15 47341 ggccttctgt actctggctg cagactgacc ttgctcaggt ccgagaagga tggagcagcc 

47401 actggagtgg atgccatctg cacccaccgt cttgacccca aaagccctgg agtggacagg 

47461 gagcaactat actgggagct gagccagctg accaatggca ttaaagaact gggcccctac 

47521 accctggaca ggaacagtct ctatgtcaat gggttcaccc attggatccc tgtgcccacc 

47581 agcagcactc ctgggacctc cacagtggac cttgggtcag ggactccatc ctccctcccc 

20 47641 agccccacaa ctgctggccc tctcctggtg ccgttcaccc tcaacttcac catcaccaac 

47701 ctgaagtacg aggaggacat gcattgccct ggctccagga agttcaacac cacagagaga 

47761 gtcctgcaga gtctgcttgg tcccatgttc aagaacacca gtgttggccc tctgtactct 

47821 ggctgcagac tgaccttgct caggtccgag aaggatggag cagccactgg agtggatgcc 

47881 atctgcaccc accgtcttga ccccaaaagc cctggagtgg acagggagca gctatactgg 

25 47941 gagctgagcc agctgaccaa tggcatcaaa gagctgggtc cctacaccct ggacagaaac 

4 8001 agtctctatg tcaatggttt cacccatcag acctctgcgc ccaacaccag cactcctggg 

48061 acctccacag tggaccttgg gacctcaggg actccatcct ccctccccag ccctacancn 

48121 nctgnccctc tcctgntncc nttcaccntc aacttnacca tcaccaacct gcantangng 

48181 gannacatgc nncncccngg ntccaggaag ttcaacacca cngagngngt nctgcagggt 

30 48241 ctgctnnnnc ccntnttcaa gaacnccagt gtnggccntc tgtactctgg ctgcagactg 

48301 acc time tea ggncngagaa gnatggngca gecactggan tggatgecat ctgcanccac 

48361 enncntnanc ccaaaagncc tggactgnac agngagenge tntactggga getnagecan 

48421 ctgaccaann neatennnga gctgggnccc tacaccctgg acaggnacag tctctatgtc 

48481 aatggtttca cccattggat ccctgtgccc accagcagca ctcctgggac ctccacagtg 

35 4 8541 gaccttgggt cagggactcc atcctccctc cccagcccca caactgetgg ccctctcctg 

4 8601 gtgccgttca ccctcaactt caccatcacc aacctgaagt acgaggagga catgeattge 

4 8661 cctggctcca ggaagttcaa caccacagag agagtcctgc agagtctget tggtcccatg 

4 8721 ttcaagaaca ccagtgttgg ccctctgtac tetggctgea gactgacctc gctcaggtcc 

4 8781 gagaaggatg gagcagccac tggagtggat gccatctgca cccaccgtgt tgaccccaaa 

40 48841 agecctggag tggacaggga gcagctatac tgggagctga gccagctgac caatggcatc 

4 8901 aaagagctgg gtccctacac cctggacaga aacagtctct atgtcaatgg tttcacccat 

4 8961 cagacctctg cgcccaacac cagcactcct gggacctcca cagtgnacnt nggnacctcn 

49021 gggactccat cctccntccc cngccncaca tctgctggcc ctctcctggt gccattcacc 

4 9081 ctcaacttca ccatcaccaa cctgcagtac gaggaggaca tgcatcaccc aggctccagg 

45 49141 aagttcaaca ccacggagcg ggtcctgcag ggtctgcttg gtcccatgtt caagaacacc 

49201 agtgtcggcc ttctgtactc tggctgeaga ctgaccttgc teaggectga gaagaatggg 

49261 gcaaccactg gaatggatgc catctgcacc caccgtcttg accccaaaag ccctggactg 

49321 nacagngagc ngctntactg ggagctnagc canctgacca annncatenn ngagctgggn 

49381 ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcn ganctctgng 

50 49441 cccaccacca gcactcctgg gacctccaca gtgnacntng gnacctcngg gactccatcc 

49501 tccntccccn gccncacanc nnctgnccct ctcctgntnc cnttcaccnt caacttnacc 

49561 atcaccaacc tgcantangn ggannacatg cnncncccng gntccaggaa gttcaacacc 

4 9621 acngagaggg ttctgcaggg tetgetcaaa cccttgttca ggaatagcag tctggaatac 

49681 ctctattcag getgeagact agcctcactc aggecagaga aggatagctc agecatggea 

55 4 9741 gtggatgcca tctgcacaca tcgccctgac cctgaagacc tcggactgga cagagagega 

49801 ctgtactggg agctgagcaa tctgaeaaat ggcatccagg agctgggccc ctacaccctg 

49861 gaeeggaaca gtctctatgt caatggtttc acccatcgaa gctctatgcc caccaccagc 

49921 actcctggga cctccacagt ggatgtggga acctcaggga ctccatcctc cagccccagc 

49981 cccacgactg ctggccctct cctgatacca ttcaccctca acttcaccat caccaacctg 

60 50041 cagtatgggg aggacatggg tcaccctggc tccaggaagt tcaacaccac agagagggtc 

50101 ctgcagggtc tgcttggtcc catattcaag aacaccagtg ttggccctct gtactctggc 
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50161 tgcagactga cctctctcag gtctgagaag gatggagcag ccactggagt ggatgccatc 
50221 tgcatccatc atcttgaccc caaaagccct ggactcaaca gagagcggct gtactgggag 
50281 ctgagccaac tgaccaatgg catcaaagag ctgggcccct acaccctgga caggaacagt 
50341 ctctatgtca atggtttcac ccatcggacc tctgtgccca ccaccagcac tcctgggacc 
5 50401 tccacagtgg accttggaac ctcagggact ccattctccc tcccaagccc cgcaactgct 

50461 ggccctctcc tggtgctgtt caccctcaac ttcaccatca ccaacctgaa gtatgaggag 
50521 gacatgcatc gccctggctc caggaagttc aacaccactg agagggtcct gcagactctg 
50581 cttggtccta tgttcaagaa caccagtgtt ggccttctgt actctggctg cagactgacc 
50641 ttgctcaggt ccgagaagga tggagcagcc actggagtgg atgccatctg cacccaccgt 

10 50701 cttgacccca aaagccctgg actgnacagn gagcngctnt actgggagct aagccanctg 

50761 accaannnca tcnnngagct gggnccctac accctggaca ggnacagtct ctatgtcaat 
50821 ggtttcaccc atcnganctc tgngcccacc accagcactc ctgggacctc cacagtgnac 
50881 ntnggnacct cngggactcc atcctccntc cccngccnca cancnnctgn ccctctcctg 
50941 ntnccnttca ccntcaactt naccatcacc aacctgcant angngganna catgcnncnc 

15 51001 ccnggntcca ggaagttcaa caccacngag agagtccttc agggtctgct caggcctgtg 

51061 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcaggccc 
51121 aagaaggatg gggcagccac caaagtggat gccatctgca cctaccgccc tgatcccaaa 
51181 agccctggac tggacagaga gcagctatac tgggagctga gccagctaac ccacagcatc 
51241 actgagctgg gcccctacac ccaggacagg gacagtctct atgtcaatgg cttcacccat 

20 51301 cggagctctg tgccaaacac cagtattcct gggacctctg cagtgcacct ggaaaccact 

51361 gggactccat cctccttccc cggccacaca gagcctggcc ctctcctgat accattcact 
51421 ttcaacttta ccatcaccaa cctgcgttat gaggaaaaca tgcaacaccc tggttccagg 
51481 aagttcaaca ccacggagag ggttctgcag ggtctgctca cgcccttgtt caagaacacc 
51541 agtgttggcc ctctgtactc tggctgcaga ctgaccttgc tcagacctga gaagcaggag 

25 51601 gcagccactg gagtggacac catctgtacc caccgcgttg atcccatcgg acctggactg 

51661 gacagagagc ggctatactg ggagctgagc cagctgacca acagcatcac agagctggga 
51721 ccctacaccc tggataggga cagtctctat gtcgatggct tcaacccttg gagctctgtg 
51781 ccaaccacca gcactcctgg gacctccaca gtgcacctgg caacctctgg gactccatcc 
51841 cccctgcctg gccacacagc ccctgtccct ctcttgatac cattcaccct caactttacc 

30 51901 atcaccgacc tgcattatga agaaaacatg caacaccctg gttccaggaa gttcaacacc 

51961 acggagaggg ttctgcaggg tctgctcaag cccttgttca agagcaccag cgttggccct 
52021 ctgtactctg gctgcagact gaccttgctc agacctgaga aacatggggc agccactgga 
52081 gtggacgcca tctgcaccct ccgccttgat cccactggtc ctggactgga cagagagcgg 
52141 ctatactggg agctgagcca gctgaccaac agcatcacag agctgggacc ctacaccctg 

35 522 01 gatagggaca gtctctatgt caatggcttc aacccttgga gctctgtgcc aaccaccagc 

522 61 actcctggga cctccacagt gcacctggca acctctggga ctccatcctc cctgcctggc 
52321 cacacaactg ctggccctct cctggtgccg ttcaccctca acttcaccat caccaacctg 
52381 aagtacgagg aggacatgca ttgccctggc tccaggaagt tcaacaccac agagagagtc 
52441 ctgcagagtc tgcatggtcc catgttcaag aacaccagtg ttggccctct gtactctggc 

40 52501 tgcagactga ccttgctcag gtccgagaag gatggagcag ccactggagt ggatgccatc 

52561 tgcacccacc gtcttgaccc caaaagccct ggactgnaca gngagcngct ntactgggag 
52621 ctnagccanc tgaccaannn catcnnngag ctgggnccct acaccctgga caggnacagt 
52681 ctctatgtca atggtttcac ccatcnganc tctgngccca ccaccagcac tcctgggacc 
52741 tccacagtgn acntnggnac ctcngggact ccatcctccn tccccngccn cacancnnct 

45 52801 gnccctctcc tgntnccntt caccntcaac ttnaccatca ccaacctgca ntangnggan 

52861 nacatgcnnc ncccnggntc caggaagttc aacaccacng agngngtnct gcagggtctg 
52921 ctnnnncccn tnttcaagaa cnccagtgtn ggccntctgt actctggctg cagactgacc 
52981 tnnctcaggn cngagaagna tggngcagcc actggantgg atgccatctg canccaccnn 
53041 cntnanccca aaagncctgg actgnacagn gagcngctnt actgggagct nagccanctg 

50 53101 accaacagca tcacagagct gggaccctac accctggata gggacagtct ctatgtcaat 

53161 ggtttcaccc atcgaagctc tatgcccacc accagtattc ctgggacctc tgcagtgcac 
53221 ctggaaacct ctgggactcc agcctccctc cctggccaca cagcccctgg ccctctcctg 
53281 gtgccattca ccctcaactt cactatcacc aacctgcagt atgaggagga catgcgtcac 
53341 cctggttcca ggaagttcaa caccacggag agagtcctgc agggtctgct caagcccttg 

55 53401 ttcaagagca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcaggcct 

53461 gaaaaacgtg gggcagccac cggcgtggac accatctgca ctcaccgcct tgaccctcta 
53521 aaccctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 
53581 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
53641 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 

60 53701 gggactccat cctccntccc cngccncaca ncnnctgncc ctctcctgnt nccnttcacc 

53761 ntcaacttna ccatcaccaa cctgcantan gnggannaca tgcnncnccc nggntccagg 
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53821 aagttcaaca ccacngagng ngtnctgcag ggtctgctnn nncccntntt caagaacncc 
53B81 agtgtnggcc ntctgtactc tggctgcaga ctgacctnnc tcaggncnga gaagnatggn 
53941 gcagccactg gantggatgc catctgcanc caccnncntn ancccaaaag ncctggactg 
54001 nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 
5 54061 ccctacaccc tggacaggna cagtctctat gtcaatggtt ttcaccctcg gagctctgtg 

54121 ccaaccacca gcactcctgg gacctccaca gtgcacctgg caacctctgg gactccatcc 
54181 tccctgcctg gccacacagc ccctgtccct ctcttgatac cattcaccct caactttacc 
54241 atcaccaacc tgcattatga agaaaacatg caacaccctg gttccaggaa gttcaacacc 
54301 acggagcggg tcctgcaggg tctgcttggt cccatgttca agaacacaag tgtcggcctt 

10 54361 ctgtactctg gctgcagact gaccttgctc aggcctgaga agaatggggc agccactgga 

54421 atggatgcca tctgcagcca ccgtcttgac cccaaaagcc ctggactgna cagngagcng 
54481 ctntactggg agctnagcca nctgaccaan nncatcnnng agctgggncc ctacaccctg 
54541 gacaggnaca gtctctatgt caatggtttc acccatcnga nctctgngcc caccaccagc 
54601 actcctggga cctccacagt gnacntnggn acctcnggga ctccatcctc cntccccngc 

15 54661 cncacancnn ctgnccctct cctgntnccn ttcaccntca acttnaccat caccaacctg 

54721 cantangngg annacatgcn ncncccnggn tccaggaagt tcaacaccac ngagngngtn 
54781 ctgcagggtc tgctnnnncc cntnttcaag aacnccagtg tnggccntct gtactctggc 
54841 tgcagactga cctnnctcag gncngagaag natggngcag ccactggant ggatgccatc 
54901 tgcanccacc nncntnancc caaaagncct ggactgnaca gngagcngct ntactgggag 

20 54961 ctnagccanc tgaccaannn catcnnngag ctgggnccct acaccctgga caggnacagt 

55021 ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 
55081 tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacagagcct 
55141 ggccctctcc tgataccatt cactttcaac tttaccatca ccaacctgca ttatgaggaa 
55201 aacatgcaac accctggttc caggaagttc aacaccacgg agagggttct gcagggtctg 

25 55261 ctcacgccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagactgacc 

55321 ttgctcagac ctgagaagca ggaggcagcc actggagtgg acaccatctg tacccaccgc 
55381 gttgatccca tcggacctgg actgnacagn gagcngctnt actgggagct nagccanctg 
55441 accaannnca tcnnngagct gggnccctac accctggaca ggnacagtct ctatgtcaat 
55501 ggtttcaccc atcnganctc tgngcccacc accagcactc ctgggacctc cacagtgnac 

30 55561 ntnggnacct cngggactcc atcctccntc cccngccnca cancnnctgn ccctctcctg 

55621 ntnccnttca ccntcaactt naccatcacc aacctgcant angngganna catgcnncnc 
55681 ccnggntcca ggaagttcaa caccacngag ngngtnctgc agggtctgct nnnncccntn 
55741 ttcaagaacn ccagtgtngg ccntctgtac tctggctgca gactgacctn nctcaggncn 
55801 gagaagnatg gngcagccac tggantggat gccatctgca nccaccnncn tnancccaaa 

35 55861 agncctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 

55921 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
55981 cggagctctg tgccaaccac cagcagtcct gggacctcca cagtgcacct ggcaacctct 
56041 gggactccat cctccctgcc tggccacaca gcccctgtcc ctctcttgat accattcacc 
56101 ctcaacttta ccatcaccaa cctgcattat gaagaaaaca tgcaacaccc tggttccagg 

40 56161 aagttcaaca ccacggagag ggttctgcag ggtctgctca agcccttgtt caagagcacc 

56221 agtgttggcc ctctgtactc tggctgcaga ctgaccttgc tcagacctga gaaacatggg 
56281 gcagccactg gagtggacgc catctgcacc ctccgccttg atcccactgg tcctggactg 
56341 nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 
56401 ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcn ganctctgng 

45 56461 cccaccacca gcactcctgg gacctccaca gtgnacntng gnacctcngg gactccatcc 

56521 tccntccccn gccncacanc nnctgnccct ctcctgntnc cnttcaccnt caacttnacc 
56581 atcaccaacc tgcantangn ggannacatg cnncncccng gntccaggaa gttcaacacc 
56641 acngagngng tnctgcaggg tctgctnnnn cccntnttca agaacnccag tgtnggccnt 
56701 ctgtactctg gctgcagact gacctnnctc aggncngaga agnatggngc agccactgga 

50 56761 ntggatgcca tctgcancca ccnncntnan cccaaaagnc ctggactgna cagngagcng 

56821 ctntactggg agctnagcca nctgaccaan nncatcnnng agctgggncc ctacaccctg 
56881 gacaggnaca gtctctatgt caatggtttc acccatcgga cctctgtgcc caccaccagc 
56941 actcctggga cctccacagt gcacctggca acctctggga ctccatcctc cctgcctggc 
57001 cacacagccc ctgtccctct cttgatacca ttcaccctca actttaccat caccaacctg 

55 57061 cagtatgagg aggacatgca tcgccctgga tctaggaagt tcaacaccac agagagggtc 

57121 ctgcagggtc tgcttagtcc cattttcaag aactccagtg ttggccctct gtactctggc 
57181 tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 
57241 tgcctctacc accctaatcc caaaagacct gggctggaca gagagcagct gtactgcgag 
57301 ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 

60 57361 ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 

57421 tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacancnnct 
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57481 gnccctctcc tgntnccntt caccntcaac ttnaccatca ccaacctgca ntangnggan 
57541 nacatgcnnc ncccnggntc caggaagttc aacaccacng agngngtnct gcagggtctg 
57601 ctnnnncccn tnttcaagaa cnccagtgtn ggccntctgt actctggctg cagactgacc 
57661 tnnctcaggn cngagaagna tggngcagcc actggantgg atgccatctg canccaccnn 
5 57721 cntnanccca aaagncctgg actgnacagn gagcngctnt actgggagct nagccanctg 

57781 accaannnca tcnnngagct gggnccctac accctggaca ggnacagtct ctatgtcaat 
57841 ggtttcaccc attggagctc tgggctcacc accagcactc cttggacttc cacagttgac 
57901 cttggaacct cagggactcc atcccccgtc cccagcccca caactgctgg ccctctcctg 
57961 gtgccattca ccctaaactt caccatcacc aacctgcagt atgaggagga catgcatcgc 
10 58021 cctggatcta ggaagttcaa cgccacagag agggtcctgc agggtctgct tagtcccata 

58081 ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcagacct 
58141 gagaagcagg aggcagccac tggagtggac accatctgta cccaccgcgt tgatcccatc 
58201 ggacctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 
58261 nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 
15 58321 cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 

58381 gggactccat cctccntccc cngccncaca ncnnctgncc ctctcctgnt nccnttcacc 
58441 ntcaacttna ccatcaccaa cctgcantan gnggannaca tgcnncnccc nggntccagg 
58501 aagttcaaca ccacngagng ngtnctgcag ggtctgctnn nncccntntt caagaacncc 
58561 agtgtnggcc ntctgtactc tggctgcaga ctgacctnnc tcaggncnga gaagnatggn 
58621 gcagccactg gantggatgc catctgcanc caccnncntn ancccaaaag ncctggactg 
58681 nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 
58741 ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcg gagctttggg 
58801 ctcaccacca gcactccttg gacttccaca gttgaccttg gaacctcagg gactccatcc 
58861 cccgtcccca gccccacaac tgctggccct ctcctggtgc cattcaccct aaacttcacc 
25 58921 atcaccaacc tgcagtatga ggaggacatg catcgccctg gctccaggaa gttcaacacc 

58981 acggagaggg tccttcaggg tctgcttacg. cccttgttca ggaacaccag tgtcagctct 
59041 ctgtactctg gttgcagact gaccttgctc aggcctgaga aggatggggc agccaccaga 
59101 gtggatgctg tctgcaccca tcgtcctgac cccaaaagcc ctggactgna cagngagcng 
59161 ctntactggg agctnagcca nctgaccaan nncatcnnng agctgggncc ctacaccctg 
59221 gacaggnaca gtctctatgt caatggtttc acccatcnga nctctgngcc caccaccagc 
59281 actcctggga cctccacagt gnacntnggn acctcnggga ctccatcctc cntccccngc 
59341 cncacancnn ctgnccctct cctgntnccn ttcaccntca acttnaccat caccaacctg 
59401 cantangngg annacatgcn ncncccnggn tccaggaagt tcaacaccac ngagngngtn 
59461 ctgcagggtc tgctnunncc cntnttcaag aacnccagtg tnggccntct gtactctggc 
35 59521 tgcagactga cctnnctcag gncngagaag natggngcag ccactggant ggatgccatc 

59581 tgcanccacc nncntnancc caaaagncct ggactgnaca gngagcngct ntactgggag 
59641 ctnagccanc tgaccaannn catcnnngag ctgggnccct acaccctgga caggnacagt 
59701 ctctatgtca atggtttcac ccattggatc cctgtgccca ccagcagcac tcctgggacc 
59761 tccacagtgg accttgggtc agggactcca tcctccctcc ccagccccac aactgctggc 
40 59821 cctctcctgg taccattcac cctcaacttc accatcacca acctgcagta tggggaggac 

59881 atgggtcacc ctggctccag gaagttcaac accacagaga gggtcctgca gggtctgctt 
59941 ggtcccatat tcaagaacac cagtgttggc cctctgtact ctggctgcag actgacctct 
60001 ctcaggtccg agaaggatgg agcagccact ggagtggatg ccatctgcat ccatcatctt 
60061 gaccccaaaa gccctggact gnacagngag cngctntact gggagctnag ccanctgacc 
60121 aannncatcn nngagctggg nccctacacc ctggacaggn acagtctcta tgtcaatggt 
60181 ttcacccatc nganctctgn gcccaccacc agcactcctg ggacctccac agtgnacntn 
60241 ggnacctcng ggactccatc ctccntcccc ngccncacan cnnctgnccc tctcctgntn 
60301 ccnttcaccn tcaacttnac catcaccaac ctgcantang nggannacat gcnncncccn 
60361 ggntccagga agttcaacac cacngagngn gtnctgcagg gtctgctnnn ncccntnttc 
60421 aagaacncca gtgtnggccn tctgtactct ggctgcagac tgacctnnct caggncngag 
60481 aagnatggng cagccactgg antggatgcc atctgcancc accnncntna ncccaaaagn 
60541 cctggactgn acagngagcn gctntactgg gagctnagcc anctgaccaa nnncatcnnn 
60601 gagctgggnc cctacaccct ggacaggnac agtctctatg tcaatggttt cacccatcag 
60661 acctttgcgc ccaacaccag cactcctggg acctccacag tggaccttgg gacctcaggg 
55 60721 actccatcct ccctccccag ccctacatct gctggccctc tcctggtgcc attcaccctc 

60781 aacttcacca tcaccaacct gcagtacgag gaggacatgc atcacccagg ctccaggaag 
60841 ttcaacacca cggagcgggt cctgcagggt ctgcttggtc ccatgttcaa gaacaccagt 
60901 gtcggccttc tgtactctgg ctgcagactg accttgctca ggcctgagaa gaatggggca 
60961 gccaccagag tggatgctgt ctgcacccat cgtcctgacc ccaaaagccc tggactgnac 
60 61021 agngagcngc tntactggga gctnagccan ctgaccaann ncatcnnnga gctgggnccc 

61081 tacaccctgg acaggnacag tctctatgtc aatggtttca cccatcngan ctctgngccc 
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61141 accaccagca ctcctgggac ctccacagtg nacntnggna cctcngggac tccatcctcc 
61201 ntccccngcc ncacagcccc tgtccctctc ttgataccat tcaccctcaa ctttaccatc 
61261 accaacctgc attatgaaga aaacatgcaa caccctggtt ccaggaagtt caacaccacg 
61321 gagagggttc tgcagggtct gctcaagccc ttgttcaaga gcaccagcgt tggccctctg 
5 61381 tactctggct gcagactgac cttgctcaga cctgagaaac atggggcagc cactggagtg 

61441 gacgccatct gcaccctccg ccttgatccc actggtcctg gactggacag agagcggcta 
61501 tactgggagc tgagccagct gaccaacagc gttacagagc tgggccccta caccctggac 
61561 agggacagtc tctatgtcaa tggcttcacc cagcggagct ctgtgccaac caccagtatt 
61621 cctgggacct ctgcagtgca cctggaaacc tctgggactc cagcctccct ccctggccac 

10 61681 acagcccctg gccctctcct ggtgccattc accctcaact tcactatcac caacctgcag 

61741 tatgaggtgg acatgcgtca ccctggttcc aggaagttca acaccacgga gagagtcctg 
61801 cagggtctgc tcaagccctt gttcaagagc accagtgttg gccctctgta ctctggctgc 
61861 agactgacct tgctcaggcc tgaaaaacgt ggggcagcca ccggcgtgga caccatctgc 
61921 actcaccgcc ttgaccctct aaaccctgga ctggacagag agcagctata ctgggagctg 

15 61981 agcaaactga cccgtggcat catcgagctg ggcccctacc tcctggacag aggcagtctc 

62041 tatgtcaatg gtttcaccca tcggaacttt gtgcccatca ccagcactcc tgggacctcc 
62101 acagtacacc taggaacctc tgaaactcca tcctccctac ctagacccat agtgcctggc 
62161 cctctcctgg tgccattcac cctcaacttc accatcacca acttgcagta tgaggaggcc 
62221 atgcgacacc ctggctccag gaagttcaat accacggaga gggtcctaca gggtctgctc 

20 62281 aggcccttgt tcaagaatac cagtatcggc cctctgtact ccagctgcag actgaccttg 

62341 ctcaggccag agaaggacaa ggcagccacc agagtggatg ccatctgtac ccaccaccct 
62401 gaccctcaaa gccctggact gaacagagag cagctgtact gggagctgag ccagctgacc 
62461 cacggcatca ctgagctggg cccctacacc ctggacaggg acagtctcta tgtcgatggt 
62521 ttcactcatt ggagccccat accgaccacc agcactcctg ggacctccat agtgaacctg 

25 62581 ggaacctctg ggatcccacc ttccctccct gaaactacan rmnctgnccc tctcctgntn 

62641 ccnttcaccn tcaacttnac catcaccaac ctgcantang nggannacat gcnncncccn 
62701 ggntccagga agttcaacac cacngagagg gttctgcagg gtctgctcaa gcccttgttc 
62761 aagagcacca gtgttggccc tctgtattct ggctgcagac tgaccttgct caggcctgag 
62821 aaggacggag tagccaccag agtggacgcc atctgcaccc accgccctga ccccaaaatc 

30 62881 cctgggctag acagacagca gctatactgg gagctgagcc agctgaccca cagcatcact 

62941 gagctgggac cctacaccct ggatagggac agtctctatg tcaatggttt cacccagcgg 
63001 agctctgtgc ccaccaccag cactcctggg actttcacag tacagccgga aacctctgag 
63061 actccatcat ccctccctgg ccccacagcc actggccctg tcctgctgcc attcaccctc 
63121 aattttacca tcactaacct gcagtatgag gaggacatgc atcgccctgg ctccaggaag 

35 63181 ttcaacacca cggagagggt ccttcagggt ctgcttatgc ccttgttcaa gaacaccagt 

63241 gtcagctctc tgtactctgg ttgcagactg accttgctca ggcctgagaa ggatggggca 
63301 gccaccagag tggatgctgt ctgcacccat cgtcctgacc ccaaaagccc tggactggac 
63361 agagagcggc tgtactggaa gctgagccag ctgacccacg gcatcactga gctgggcccc 
63421 tacaccctgg acaggcacag tctctatgtc aatggtttca cccatcagag ctctatgacg 

40 63481 accaccagaa ctcctgatac ctccacaatg cacctggcaa cctcgagaac tccagcctcc 

63541 ctgtctggac ctacgaccgc cagccctctc ctggtgctat tcacaattaa cttcaccatc 
63601 actaacctgc ggtatgagga gaacatgcat caccctggct ctagaaagtt taacaccacg 
63661 gagagagtcc ttcagggtct gctcaggcct gtgttcaaga acaccagtgt tggccctctg 
63721 tactctggct gcagactgac cttgctcagg cccaagaagg atggggcagc caccaaagtg 

45 63781 gatgccatct gcacctaccg ccctgatccc aaaagccctg gactggacag agagcagcta 

63841 tactgggagc tgagccagct aacccacagc atcactgagc tgggccccta caccctggac 
63901 agggacagtc tctatgtcaa tggtttcaca cagcggagct ctgtgcccac cactagcatt 
63961 cctgggaccc ccacagtgga cctgggaaca tctgggactc cagtttctaa acctggtccc 
64021 tcggctgcca gccctctcct ggtgctattc actctcaact tcaccatcac caacctgcgg 

50 64081 tatgaggaga acatgcagca ccctggctcc aggaagttca acaccacgga gagggtcctt 

64141 cagggcctgc tcaggtccct gttcaagagc accagtgttg gccctctgta ctctggctgc 
64201 agactgactt tgctcaggcc tgaaaaggat gggacagcca ctggagtgga tgccatctgc 
64261 acccaccacc ctgaccccaa aagccctagg ctggacagag agcagctgta ttgggagctg 
64321 agccagctga cccacaatat cactgagctg ggccactatg ccctggacaa cgacagcctc 

55 64381 tttgtcaatg gtttcactca tcggagctct gtgtccacca ccagcactcc tgggaccccc 

64441 acagtgtatc tgggagcatc taagactcca gcctcgatat ttggcccttc agctgccagc 
64501 catctcctga tactattcac cctcaacttc accatcacta acctgcggta tgaggagaac 
64561 atgtggcctg gctccaggaa gttcaacact acagagaggg tccttcaggg cctgctaagg 
64621 cccttgttca agaacaccag tgttggccct ctgtactctg gctccaggct gaccttgctc 

60 64681 aggccagaga aagatgggga agccaccgga gtggatgcca tctgcaccca ccgccctgac 

64741 cccacaggcc ctgggctgga cagagagcag ctgtatttgg agctgagcca gctgacccac 
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64801 agcatcactg agctgggccc ctacacactg gacagggaca gtctctatgt caatggtttc 
64861 acccatcgga gctctgtacc caccaccagc accggggtgg tcagcgagga gccattcaca 
64921 ctgaacttca ccatcaacaa cctgcgctac atggcggaca tgggccaacc cggctccctc 
64981 aagttcaaca tcacagacaa cgtcatgaag cacctgctca gtcctttgtt ccagaggagc 
5 65041 agcctgggtg cacggtacac aggctgcagg gtcatcgcac taaggtctgt gaagaacggt 

65101 gctgagacac gggtggacct cctctgcacc tacctgcagc ccctcagcgg cccaggtctg 
65161 cctatcaagc aggtgttcca tgagctgagc cagcagaccc atggcatcac ccggctgggc 
65221 ccctactctc tggacaaaga cagcctctac cttaacggtt acaatgaacc tggtctagat 
65281 gagcctccta caactcccaa gccagccacc acattcctgc ctcctctgtc agaagccaca 

10 65341 acagccatgg ggtaccacct gaagaccctc acactcaact tcaccatctc caatctccag 

65401 tattcaccag atatgggcaa gggctcagct acattcaact ccaccgaggg ggtccttcag 
65461 cacctgctca gacccttgtt ccagaagagc agcatgggcc ccttctactt gggttgccaa 
65521 ctgatctccc tcaggcctga gaaggatggg gcagccactg gtgtggacac cacctgcacc 
65581 taccaccctg accctgtggg ccccgggctg gacatacagc agctttactg ggagctgagt 

15 65641 cagctgaccc atggtgtcac ccaactgggc ttctatgtcc tggacaggga tagcctcttc 

65701 atcaatggct atgcacccca gaatttatca atccggggcg agtaccagat aaatttccac 
65761 attgtcaact ggaacctcag taatccagac cccacatcct cagagtacat caccctgctg 
65821 agggacatcc aggacaaggt caccacactc tacaaaggca gtcaactaca tgacacattc 
65881 cgcttctgcc tggtcaccaa cttgacgatg gactccgtgt tggtcactgt caaggcattg 

20 65941 ttctcctcca atttggaccc cagcctggtg gagcaagtct ttctagataa gaccctgaat 

66001 gcctcattcc attggctggg ctccacctac cagttggtgg acatccatgt gacagaaatg 
66061 gagtcatcag tttatcaacc aacaagcagc tccagcaccc agcacttcta cctgaatttc 
66121 accatcacca acctaccata ttcccaggac aaagcccagc caggcaccac caattaccag 
66181 aggaacaaaa ggaatattga ggatgcgctc aaccaactct tccgaaacag cagcatcaag 

25 66241 agttattttt ctgactgtca agtttcaaca ttcaggtctg tccccaacag gcaccacacc 

66301 ggggtggact ccctgtgtaa cttctcgcca ctggctcgga gagtagacag agttgccatc 
66361 tatgaggaat ttctgcggat gacccggaat ggtacccagc tgcagaactt caccctggac 
66421 aggagcagtg tccttgtgga tgggtattct cccaacagaa atgagccctt aactgggaat 
66481 tctgaccttc ccttctgggc tgtcatcctc atcggcttgg caggactcct gggactcatc 

30 66541 acatgcctga tctgcggtgt cctggtgacc acccgccggc ggaagaagga aggagaatac 

66601 aacgtccagc aacagtgccc aggctactac cagtcacacc tagacctgga ggatctgcaa 
66661 tgactggaac ttgccggtgc ctggggtgcc tttcccccag ccagggtcca aagaagcttg 
66721 gctggggcag aaataaacca tattggtcgg aaaaaaaaaa aaaaa 

35 SEQ ID NO. 3 
hk5 amino acid 

MATARPPWMWVLCALITALLLGVTEHVLANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDDSSSRIINGSD 
CDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLSPVYESGQQMFQGVKSIPHPG 
40 YSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCLVSGWGTTKSPQVHFPKVLQCLNISVLSQKR 
CEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGPWCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQET 
IQANS 



45 SEQ ID NO. 4 
KLK5 CDS 

ggtgtctgtg cgtcctgcac 

tgctagactc ctatcttctg 

gcccgtcctt gtggttcctc 

50 aagacccccc tggatgtggg 

agagcatgtt ctcgccaaca 

tgggagcaac caggacctgg 

cagccgcatc atcaatggat 

gttgctaagg cccaaccagc 

55 cacggccgcc cactgcagga 

accagtttat gaatctgggc 

ctactcccac cctggccact 

tcccactaaa gatgtcagac 



ccacatcttt ctctgtcccc 
aattctatag tgcctgggtc 
tctacttggg gaaatcaggt 
tgctctgtgc tctgatcaca 
atgatgtttc ctgtgaccac 
gagctggggc cggggaagac 
ccgactgcga tatgcacacc 
tctactgcgg ggcggtgttg 
agaaagtttt cagagtccgt 
agcagatgtt ccagggggtc 
ctaacgacct catgctcatc 
ccatcaacgt ctcctctcat 



tccttgccct gtctggaggc 
tcagcgcagt gccgatggtg 
gcagcggcca tggctacagc 
gccttgcttc tgggggtcac 
ccctctaaca ccgtgccctc 
gcccggtcgg atgacagcag 
cagccgtggc aggccgcgct 
gtgcatccac agtggctgct 
ctcggccact actccctgtc 
aaatccatcc cccaccctgg 
aaactgaaca gaagaattcg 
tgtccctctg ctgggacaaa 
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gtgcttggtg tctggctggg. ggacaaccaa 

ccagtgcttg aatatcagcg tgctaagtca 

gatagatgac accatgttct gcgccggtga 

ttctgggggg cctgtggtct gcaatggctc 

cccttgtgcc cggcccaaca gaccgggtgt 

gatccaggaa accatccagg ccaactcctg 

cccacctgct gcagggacag ccctgacact 

gttgagaatg ttcatctctc cagcccctga 

cccccacatt gggctgaccg tgtctctcta 

tccagggcgg gggttgcgtc tcaatctccc 

catcccttct ctgcagctct gacccaaatt 
aaaaaaa 



gagcccccaa gtgcacttcc ctaaggtcct 
gaaaaggtgc gaggatgctt acccgagaca 
caaagcaggt agagactcct gccagggtga 
cctgcaggga ctcgtgtcct ggggagatta 
ctacacgaac ctctgcaagt tcaccaagtg 
agtcatccca ggactcagca caccggcatc 
cctttcagac cctcattcct tcccagagat 
ccccatgtct cctggactca gggtctgctt 
gttgaaccct gggaacaatt tccaaaactg 
tggggcactt tcatcctcaa gctcagggcc 
tagtcccaga aataaactga gaagtggaaa 



SEQ ID NO. 5 
KLK5 nucleic acid 

gggcccagag tgaaggcaag agaaggagtt 
tcccctgcct aaaatgcagg gagagggagg 
aagaaagaga gagagagaga gagacagaat 
acagagagcc tgggacacag ggacacacag 
acacaaatgg agacacagag gtgtaaagaa 
aaaggggcag aagcacagtt ttcagggtgg 
tttttttttt tttttgagac ggagtctcgc 
gatctcggct cactgcaagc tccgcctccc 
ccaagtagct gggactacag gcgcccgcca 
gtagagacgg ggtttcaccg ttttagccgg 
gcccgcctcg gcctcccaaa gtgctgggat 
atcatcttct tgactatgct gatgtgacaa 
aatatgcagt ttgggccagg caccgtggct 
agaggtgggt gaatcacttg aggccaggag 
ctctgtcttt actaaaaaaa aaaaaaaaaa 
acctgtaatc ccagctatgc tggaggctga 
gaggttgcag tgggccgaga tcacatcacc 
ctgtctcaaa taaataaata aacaaacgaa 
aaaaaaaaaa tgctgtcaac aaatagagca 
agaactctaa ggtatatttg acaaatcatt 
ggcatagaaa gacagggagg aacagggaga 
acaaggctcc taagacagac aggaggagag 
aaaaagacag agagagagag acagagacag 
gagagagggg tggagagaga cacgagatat 
gaaccacaga gagatggaag aagactctga 
agtatcgagg gtgaacagac agtggtggaa 
tccaggcgcc aagaatagtg acccagagtt 
aggcagggaa ggggctggcc tggcttccgg 
gtagggagtg acattccgga ctgggtgggg 
ggaggagcta ttgctaaggc ccgataggca 
cagtgggtgg ttataactca ggcccggtgc 
aggcacaggc ctgagaagtc tgcggctgag 
gggacagggc aagtgagacc tggtgagggt 
gtgcgtcctg cacccacatc tttctctgtc 
ctcctatctt ctgaattcta tagtgcctgg 
cttgtggttc ctctctacct ggggaaataa 
gctccccgga tcgcctgggc ctcccaaccc 
atggctacag caagaccccc ctggatgtgg 
ctgggggtca caggtaacca gaactctggg 
ctctgcggca ctagagcgcc tgtcccctgg 
gaccgggtga atgtgagtct ctgtctgtac 



gagagctccc tctgcaaagt ggcttgagtc 
cagaaagaca gggaagagga aggggtgggg 
aacacaacta cagaaacaca gagagaacac 
agtcagagag aaaagagaag atagagaaag 
agagagatta acagagtccc agatacacgc 
tgtctatgat catcttcttt tttttttttt 
tctgtcgccc aggctggagt gcagtggcgg 
gggttcacgc cattctcctg cctcagcctc 
ctacgcccgg ctaatttttt tgtattttta 
gatggcctcg atctcctgac ctcgtgatcc 
tacaggcgtg agccaccgcg cccggccatg 
gtacctaaag ccatcagact ctacccttta 
catgcctgta attccagcac tttgggaggc 
tttgagacca gcctggccaa catggtgaaa 
aaaaaaaatc agccgggtgt cgtggggcac 
ggcacgagag tcacttgaac cctggaggcg 
gccctccagc ctgggcgaca gagcaagact 
caagcagttt gttgtacctt agttatatct 
gaagtgaaat aaaggaaaat aaatgggcca 
cagaaccttt aaaaaagaaa gaatcacaga 
cagaaacacc tgtggcccaa ggagaacaaa 
agagagagag tgagtgagag acagacagag 
agagacagag aggcgagagg gatagaaaga 
tgagagagac tcagaaagat agccgaggga 
gaaaaaacca gagacaaaga tggaaagagg 
tgagcaaaat gcagagaaga aagcaagcaa 
ggtgagaagc cagatcctta aggctggggg 
agacccctcc ccattctccg ggccagggag 
ggtgctctgg gggtggagat agggggagca 
cctcattgcc cgggaatgtg ccccagggag 
ccagagccca ggaggaggca gtggccagga 
ctgggagcaa atcccccacc ccctacctgg 
ggctcagcag gcagggaagg agaggtgtct 
ccctccttgc cctgtctgga ggctgctaga 
gtctcagcgc agtgccgatg gtggcccgtc 
ggtaggggag ggaggggaag tgggttaagg 
tctgacattc cccatccagg tgcagcggcc 
gtgctctgtg ctctgatcac agccttgctt 
gtgggagggt tgtgggattg ggaggactgt 
ggaactgtgt gagcctgggc atgactccgg 
ttgtggttgt gcgatcgtat gtggccctgt 
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gactgccacg gtgtgtgtcg gggaggggga 
caggtggcac tgaccctttg aggctgtgtg 
attgtgtgtg gctccacagc tgtgtgggtg 
gtgtttggct gtgtgtggtg acttggcatt 
5 tccctgaggt cccgggattg cgtgcaacaa 
gtgtgctgct tgcaggcgat tatgtgattg 
tttgtgaccg tgtgactacc tgaagctctg 
tctgtgtgag gccgtgtaaa tgctactgta 
ttctgtctct gcctggaggg atagagggtg 

10 caggtgactg acttgcagtg tgtgcctgtg 
tgtgcacaca cggcatctgt gcgtggcact 
gctaggctgc ccgggagcgt gtgtacctgg 
gaggcaacat gggcgtgtct gcagaactgc 
gcgtggttct tggggtgagt tcgtgaatga 

15 gaaccaggcc gggcgcggtg gctcacgcct 
ggcggatcac ctgaggtcgg gagatcgagg 
ctactaaaaa tacaaaaaat tagctggtgt 
gggagactgg ggcagaaaaa tcgcttgaac 
tcgcgccatt gcactccagc ctgggcaaca 

20 gaaaaaaaaa agggtaagaa ccagtgaatg 
atgcatgtag tctgtaggtc tgtgtgtgag 
gttttcatct gagaattcag aaacctaggc 
ctgagccctt ctttcctggt cctgctttcg 
ccacctcctt tcctcaacca cgcccctagg 

25 cacccctttg ggccaggctc caccccctat 
agtcagagct tttttttttt tttttttgga 
tgcagtggcg tgatctcggc tcactgcaac 
gcctccacct cctgagtagc tgggattaca 
gtgtctttag tagagacagg gtttcacctt 

30 tcaggtgatc cgcccacctc ggcctcccag 
ccccagccca aagtcagagc tctttatagg 
ctaactaagt caattccaaa ccccttcctg 
tgaccccact tcttgagacc agttccatcc 
gctccagccc ccacagcttt ggcactaccc 

35 tttaccctca catgtagttc tagccaattc 
gtaaccctac ctgagcctgg gctctgtcct 
ctcttattct ccaggccctg cccctgcccc 
ggtctggcct cttgagtctg aaacccaccc 
caacccattt tccgttccca gagcatgttc 

40 cctctaacac cgtgccctct gggagcaacc 
cccggtcgga tgacagcagc agccgcatca 
agccgtggca ggccgcgctg ttgctaaggc 
tgcatccaca gtggctgctc acggccgccc 
gaggagggtt ggtggggacg gggaagtggg 

45 tcatggaggt gagggctggt ggggacgggg 
gggttggtgg ggatgggttg gggatgtggg 
taaggatgga gttttgcggg ggagcaaggt 
tgtggtaggg aatgggaagg agccaaggat 
tgttgaatgg tttgggatgg aggtggaatt 

50 aaatcgggct ggggtggaaa tgaagatagc 
atagaatgaa ggatggggat tggagttttg 
tgagaatgca tatggtgatg gcttctgggt 
ggtttggaat tgtgactggg atggggacag 
ggatggtttg gggaccgggg gtggggatgg 

55 gattggcgtt ggacgtggag atagagatca 
agagttttca gagtccgtct cggccactac 
cagatgttcc agggggtcaa atccatcccc 
aacgacctca tgctcatcaa actgaacaga 
atcaacgtct cctctcattg tccctctgct 



tgccttttcc catatcaggt gactgtgcgg 
tgtggttttg tgattgtgtg tgcatttaag 
aatgcatgta gcactggggg tgttcactgt 
gtatatgact gcaggtatct gcagttcctg 
aagtggtcat caccatggaa agctgtgact 
tggctgagtg tgacgttatg gatgcccgta 
tgtaggggtg actgtatgtg actgtgtgtg 
tgtgtgatgg tgcagctgtg tgtctggagt 
caggggtagc tatctctggg agatgggtgc 
tgcagaagag tatgtggcag tctgaacatc 
gagacactgt ggatgagggt gtgcgatccc 
agacagagct gtatgttagc tgcacctgtg 
gtgcgtgctt ggctgttact gctgttgtgc 
tggtggtgcc agggccatca gcaagggtaa 
gtaatcccag ccctttggga ggccgaggca 
ccagcctgac caacatggag aaccccgtct 
ggtggcgcgt gcctgtaatc ccagctactc 
ccgggaggtg gaggttgcgg tgagccgaga 
agagcgaaac tccgtctcga aagaaaaaaa 
ggcacgggag gactgatgat ggagtggggc 
aggaggagat tgacaggatt gagaaggcat 
ctgctcttcc cctccatgtg gccccctaag 
gaaccctagc tccgcccatg agctctgacc 
ccagactcta gtggaccccg cctaaggcca 
tctgtgggta ccttctagaa cccccttcaa 
gacagtcttg ctctctctcc caggctggag 
ctctgcctcc caggttcaag tgattctcgt 
ggtgcgcgcc accacgcctg gctaattttt 
gttggccagg ctggtctcaa actcccaacc 
agtgctgggg ttacaggcgt gagccaccgc 
agactctaac atgtaaccct gaccctggcc 
cctccagccc tgaccccact cactgaggcc 
ctaaagccct ggtctccctc ccatccccag 
ctgagcttgt ccaggaatcc tgtacccaat 
caggaatctg tgaggtccag ttagagtcca 
tgagcttgag cctgggcttg agaggtgcca 
ctcagcatgt cagacaccca ccctctagct 
ccagcccaag ccccgcctct gagccccgcc 
tcgccaacaa tgatgtttcc tgtgaccacc 
aggacctggg agctggggcc ggggaagacg 
tcaatggatc cgactgcgat atgcacaccc 
ccaaccagct ctactgcggg gcggtgttgg 
actgcaggaa gaagtgagtg ggagttccaa 
ggtgggggtg gggaagtggg ggtgggggtg 
aagtggggtt gggggtgtca tggaaggtga 
agcaggagga ggtcgagttg gggataggac 
gggaggatga ggttggagag gggagagtgt 
gggttggatt tggggttagg agcatatatt 
gggattggct ttagaattgg gggtgggtga 
atggagatag ggttgagatt gggagcagat 
ggtggggttg gagatggttg gatttgggct 
agggaaagaa ttagggttgg gaatgggatg 
gcatgggatt ggagaccaag agggagttga 
gggtggggct ggggctgggt gtggggttgg 
gggttggtgg tgacctgccc catcttcctc 
tccctgtcac cagtttatga atctgggcag 
caccctggct actcccaccc tggccactct 
agaattcgtc ccactaaaga tgtcagaccc 
gggacaaagt gcttggtgtc tggctggggg 
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acaaccaaga gcccccaagg tgagtgtcca 
gccttccatc tttctccact tctcattgtg 
cctccagtgc ttgaatatca gcgtgctaag 
acagatagat gacaccatgt tctgcgccgg 
5 gaggacacct ctctttattc agcagataca 
ttgccaaatt ctgagaatcc agcaattgcc 
ctcataccct agagtagtgg tgtttagtag 
agttttttag tagccacatt aaaacaggta 
taatcccagc actttgggag gctgaggcag 

10 tagcctggcc aacatggcga aactctgtct 
tggtggcggg cgcctgtaat ctcagctgct 
cccaggaggt ggaggttgca gtgagctgag 
gagtgacact tttgtctcaa aaagaaaaaa 
taactttaat aacccaatgt atcccaaata 

15 caattatgaa tgagatactt tacattcttt 
agtatatatg ttatgctgac agcacatctc 
gtagccacat gtggctagca gttactgtat 
gggctgtttt gtatggttgg gcaggttgtg 
gcactccgtg ttacagatgt cagttttggc 

20 tgtttcaaca aaatctgtaa tatgacagtt 
gaaggaaaag agaaatctgg taggtatttt 
ttgcaaagct gctggaaggg ctggaggaac 
agaatctgca taaatagggc aatttcagag 
tggttttagg atagtaaaca ataagggcca 

25 ttggagaggt ggcatttgag cagagaatgg 
taaggggaaa gaaaaggcac gtgcaaaggc 
aagaagaaga ggaaaccaat gcaactggag 
cgctggaggt gtaggcaggg gcgaatgctc 
cttccctatg ttctaatgga agctgtatct 

30 gttacatcaa ccagcaccct tctctgtatt 
gttaacaagc tctcattagc agggtgtgtg 
aggagtactc cagtcccatg gctatgaaaa 
tgcaacacct ccccagctct ccccatttct 
tgcgaggggg aaaactttta acagaagaaa 

35 cctgtaatcc caacactttg ggaggccgag 
accatcctgg ctgacacggt gaaaccctgt 
cgtggtggca ggcgcctgta gtcccagcta 
aacccgggag gcggaacttg cagtgagccg 
acacagtgag actccgtctc aaaaaaaaaa 

40 aagtggtggc atttaaaact atttagcctt 
cagacctcaa ggtgtttttt tgtttgtttt 
actaaaagct acaagcaaga aataataaca 
aaataatagc atctggctaa ttgctggaca 
attaactcat ttacctgtta ttattggccc 

,45 gcagttaact aacagcctct caaaagaaac 
gagagaaatt aaaccacaag aaagttgaaa 
ctttgaaaca gtgtctgcta ctgggaaaaa 
ccaggactct gtaattcata ttttgcatgc 
ggcacatgcc agtaatccca gcactctggg 

50 ggagttcaag accagcctgg gcaactaaaa 
attttagtag attttattca taccacttac 
cttttctttt cttttctttt cttttttgag 
gtgcaatggc accatatcag ctcactgcag 
cacctcagcc tcccaagtag ctgggataac 

55 ttttttccgt agagatgggg ttccaccatg 
ccagtgatct gcctgcctcg gcctcccaaa 
cccaggtggg agatagacat ttctctctac 
cattttcttc ataaatatta gccgagtggc 
gtggatatgg catcaggcaa aacagaccaa 



ggttcttctt gataccgacc catctctgcc 
ttcctgtttg acagtgcact tccctaaggt 
tcagaaaagg tgcgaggatg cttacccgag 
tgacaaagca ggtagagact cctgccaggt 
cactgagtgc caactcggta acatggagcg 
aagacagtca ggacccctgt tctcacagag 
aaataatgct gagctgctta tgtcatttcc 
aaaaaggctg ggcgcagtgg ctcacacctg 
gcagatcacc tttggtcagg agtttgagac 
ctaaaaaaaa atacaaaaat tagcctggca 
caggaggccg agacacaaga atcacttaaa 
atcgtgccac tcactccaac ctgggagaca 
aaaaacaagt aaaaaagaaa caggtgaagt 
caatcatttc aaagtgtaat taatataaaa 
tcttgttttc atattaagtc tttgaaagtg 
aatttggact agctacattt caggtgctca 
tggatggcac ggatctagag ggaaagatca 
cactgcataa agataccata tctaataggg 
agttttcagg cgtgtggtag ttaagtgtct 
ttctagcaag tgctggtaaa atatcttgag 
tacaagagaa tatttaatac aggggattaa 
aaagttaaaa aataaaaaac tctgtggtca 
agtggtaaag gttaacccca aaataaaaca 
atattcaaaa aggtggtcag gggagcctcc 
atgacacaaa gaagctaaac tcgtgaagtt 
cctgaggcag taaggaattt ggctgattca 
aacaaaagtg ggggcaacag tagaaagtga 
tgcaagtatt tcttggtcac caacacagag 
gttgaggaag acagaattta aaatcaaact 
caggctccca agggatctag aaggacgtaa 
tttcaacagt agttaggaag ctggggattc 
gctcccccca aattgtacaa acctgacaaa 
tctctgtgcc ctgggtgtgg gggggtgggt 
gcacatctcg gccgggcgtg gtggctcaca 
gcgggtggat cactaggtca ggagatggag 
ctctactaaa aacacaaaaa attagccggg 
ctcgggaggc tgaggcagga gaatggcctg 
aggttgcacc actgcactcc agcctgggca 
aaagaaaaga aaagaaatca catctcattc 
tctgtaggca aggttagtat cttgtttttc 
ttcataccgg tgtgtggtct gggtgtggcc 
actacaacaa tactaatacc aatagtataa 
ctgttttaag tggtttgcat gcctcagctc 
tattttacaa acaaggagcc aaggctcaga 
tctgcagaga tattaaattt aaaaaataat 
tttagaggta caggcagcta agcttgtttg 
ggcaagtctt ggctttccta ataattgata 
atgtaagtaa gaaatgaagc cgggtgcaat 
agactgaagt gggaagatca cttgagctca 
attaaaaaaa taaaaatact aattgttttt 
atcattattg tagtatgtac atatttattt 
acggagtctc gctctgtcac ccaggctgga 
catgcgcctc ctgggttcaa gcatttcttc 
aggcacccac caccatgcct ggctattttt 
ttggccaggc tggtcttgaa ctcctgacct 
ttgctggtat tacaggtgtg agccaccgtg 
ctcaaacaga ggtccactca agctactttt 
tattttgcac caggaatggt tccaggtgct 
aaacttcctg ccgcgtggac ctcatgttcc 
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ccaagtggaa gacaggcaat aaagagatag ataaatatgt agtaaattaa aaaaaaaaaa 
aattagccgg gtgtggtggc ttgcacctgt agttccagct acttgggagg ctgaggtggg 
agaattgctt gagcccaaac gtttgaggct gcggtaagcc atgactgcac tgctgcactc 
cagacagcag cctgggtgac aaagcaagac gtttttgtca gaaagaaaaa aaaaagagac 
5 gaagggagga aggagagaga aaggaaggaa ggaaggagaa agaaaggaag gaaggagaaa 
gaaaggaagg aaggaaggag aaagaaagga agaaagagaa agaaagaaaa agaaagaaag 
aaagaagaaa gaaaagagag aggaaggaag gaaagaagga aaagagggaa aaaaatgact 
gttgaagagc agtgagtatt attataggag ggtaattata gggaggtatg gggaattgaa 
gacaggaaac acaaattagt ccaagcgaat ggatttctat tgggagtgat tctgccccta 

10 gaagacactg gcaataccag gagacatttt tggttgtcac aactatatgg aggggcatta 
ctggcaacta atggatagat gccaagtgtg ctgttcaaca tgctatgatg cacacggcag 
gcctccacaa caaaccatta tccagcttca gatgcccaca gtgcccagat cgaggaaccc 
tcatccaggg gctgagaacc gtatttttgc agaagggagg tataaggatg ggttggtgga 
gaatggggaa ggaaggtgtg tgtccagtaa gagaaataag gcctgcacag gctggagggg 

15 agagtgagag agaaagggag gcggagagat acacgatgag ggagacaggc tggaacagaa 
agtagagacg aagattcgag atgtggagag gaagggtcac agaccccccc gaaatgatgt 
gtggacaaca ggaatctgga agaggaagat ggagtggaga gtgacaaatg gggtctaaag 
gttgaacttg gaggccaggc atggtggctc acgcctgtaa tcccaacact ttggaggctg 
aggtgggcga atcacttgag gccaggagtt cgagaccagc ctggccaaca tggtgaaacc 

20 ccgtctctac aaaaaaaata caaaaaatta gccgggtgtg gtgatggaca cctgtagtca 
cagctacttg ggaggctgag gcaggagaat tgcttgaacc cgggagatgg aggctgcagt 
gagctgaggt caggccactg cgctccaacc tgggcaacag agtaagactc catctcaaaa 
aaaaaaaagc tggatttgga gtgaaatatt aataacattc tccctctctc tccttttgcc 
tgtgtctcca tctctgtctt tttctgcatt tcttcatctc tgtactttcc atctctgtgt 

25 gtctgttccc atctgcttct ccatctatgg gcatctctgg gtctctcatg tctccttctg 
cccactttgc cacatctctg cctctctcat gccccccttt ctctcctgca gggtgattct 
ggggggcctg tggtctgcaa tggctccctg cagggactcg tgtcctgggg agattaccct 
tgtgcccggc ccaacagacc gggtgtctac acgaacctct gcaagttcac caagtggatc 
caggaaacca tccaggccaa ctcctgagtc atcccaggac tcagcacacc ggcatcccca 

30 cctgctgcag ggacagccct gacactcctt tcagaccctc attccttccc agagatgttg 
agaatgttca tctctccagc ccctgacccc atgtctcctg gactcagggt ctgcttcccc 
cacattgggc tgaccgtgtc tctctagttg aaccctggga acaatttcca aaactgtcca 
gggcgggggt tgcgtctcaa tctccctggg gcactttcat cctcaagctc agggcccatc 
ccttctctgc agctctgacc caaatttagt cccagaaata aactgagaag 



SEQ ID NO. 6 

hk6 amino acid 

40 

MKKLMVVLSLIAAAWAEEQNKLVHGGPCDKTSHPYQAALYTSGHLLCGGVLIHPLWVLTA7VHCKKPNLQVFL 
GKHNLRQRESSQEQSSVVRAVIHPDYDAASHDQDIMLLRLARPAKLSELIQPLPLERDCSANTTSCHILGWG 
KTADGDFPDTIQCAYIHLVSREECEHAYPGQITQNMLCAGDEKYGKDSCQGDSGGPLVCGDHLRGLVSWGNI 
PC GSKEKPGVYTNVCRYTNWIQKTIQAK 

45 

SEQ ID NO. 7 
KLK6 nucleic acid 
CDS 147.. 881 

gtcgacccac gcgtccggct ggctggctcg ctctctcctg gggacacaga ggtcggcagg 
50 cagcacacag agggacctac gggcagctgt tccttccccc gactcaagaa tccccggagg 
cccggaggcc tgcagcagga gcggccatga agaagctgat ggtggtgctg agtctgattg 
ctgcagcctg ggcagaggag cagaataagt tggtgcatgg cggaccctgc gacaagacat 
ctcaccccta ccaagctgcc ctctacacct cgggccactt gctctgtggt ggggtcctta 
tccatccact gtgggtcctc acagctgccc actgcaaaaa accgaatctt caggtcttcc 
55 tggggaagca taaccttcgg caaagggaga gttcccagga gcagagttct gttgtccggg 
ctgtgatcca ccctgactat gatgccgcca gccatgacca ggacatcatg ctgttgcgcc 
tggcacgccc agccaaactc tctgaactca tccagcccct tcccctggag agggactgct 
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cagccaacac caccagctgc cacatcctgg gctggggcaa gacagcagat ggtgatttcc 
ctgacaccat ccagtgtgca tacatccacc tggtgtcccg tgaggagtgt gagcatgcct 
accctggcca gatcacccag aacatgttgt gtgctgggga tgagaagtac gggaaggatt 
cctgccaggg tgattctggg ggtccgctgg tatgtggaga ccacctccga ggccttgtgt 
5 catggggtaa catcccctgt ggatcaaagg agaagccagg agtctacacc aacgtctgca 
gatacacgaa ctggatccaa aaaaccattc aggccaagtg accctgacat gtgacatcta 
cctcccgacc taccacccca ctggctggtt ccagaacgtc tctcacctag accttgcctc 
ccctcctctc ctgcccagct ctgaccctga tgcttaataa acgcagcgac gtgagggtcc 
tgattctccc tggttttacc ccagctccat ccttgcatca ctggggagga cgtgatgagt 

10 gaggacttgg gtcctcggtc ttacccccac cactaagaga atacaggaaa atcccttcta 
ggcatctcct ctccccaacc cttccacacg tttgatttct tcctgcagag gcccagccac 
gtgtctggaa tcccagctcc gctgcttact gtcggtgtcc ccttgggatg tacctttctt 
cactgcagat ttctcacctg taagatgaag ataaggatga tacagtctcc ataaggcagt 
ggctgttgga aagatttaag gtttcacacc tatgacatac atggaatagc acctgggcca 

15 ccatgcactc aataaagaat gaattttatt atgaaaaaaa aaaaaaaaaa aaaaaaaaaa 
agggcggccg c 

SEQ ID NO. 8 

KLK6 nucleic acid 

20 mRNA join (2001 . . 2185, 3084 . . 3135, 3559 . . 3606 r 4346 . . 4502 , 
8122. .8369,9791. .9927,11805. .12483) 

CDS join (3567.. 3606, 4346. .4502,8122. .8369,9791. .9927, 11805 .. 11957 ) 

25 acacttaaaa aatcttctga cttaaaaaaa aaagtatggt gattggaaaa tgtaaatgtg 
catgcgtgct tggcatcaca tttcattggc caggacttcc ctggatgcta aaggtcctca 
aatgccaggc tggggggctg ggacttggtc ccaagggaga tggggaccca gggcacgtct 
gtgagaggag gggcaaggtc agcacaaggc acaggaaggt ctctctgggg caagggatac 
agagaacaga gggatcctgg tccaggtggg agaggtgcag ctctgagttg gggttgaggg 

30 tgtgggtaca gagaggaagg gaccccccag agagaggagg cagagggata gggcctggtc 
actgggttgt gcaacatcag acttgctgtc tgtgaagata gcacgtcctg agaagaaggt 
gctgaggtca gtggggacca aatgtgagag ggagcacccg gagagtatac tgaataccga 
agtagtcttc atccctggag tgatgggggg tgcacaatgc aagatgacaa ttagattcaa 
tgcaagacaa agaaaagggt tggctgggaa cagtggctca tgcctatggt cccagctcct 

35 gggaagactg aggcgggagg gtcgcttgag cccaggaggg ttgaggctgc cacgagcaag 
gatcgtgcca ctgcactcca gcctaggcga cagaacaaga ccttgtctca aaagaaaaaa 
gaactttttt ttttaagtta cctgtagtgc ccagcccaag caggtgctga gccagacttc 
attcctatca ttgtccttat tacgcagtga cttccccctc ctcatttctc tccactctgc 
cacgcacaca ccctcaccct ccagcccata ccaaccaccc caaccactgc ctgtggtttc 

40 ccatgtgcac ccaggccagg cattttcacg gcctttcctc ctgacctacg cctggctcag 
ctttctaggc ccaagttcaa agacacctcc ctaaatcttc ccagatccct ctgctactgc 
ccagcaccac catcttatca cagccccacg tcgttcccaa gtgctctccg atttctgctt 
aactccatgc ctctcgctgt gtgtccgcat ctcatcaata agtcctcaag tcctcttcca 
tcctgctagc ttcctcatcg ctcgggaatc atccccgcta cttcctgggg aaactgactc 

45 ccttctgggc acacacagtg ctacccccgg ggaaatctaa gaagagaccc aggagaagat 
aagcacggag agtcagagaa tcaaggggaa agaaagggag agaggccggg cacagtggct 
cacacctgta atccagcact ttgggaggcc aaggtgggtg gatcacctga ggtcaggagt 
ttgagaccag cctggccaac atggtgaaac ctcttcccta ctaaaaatac aaaaaacatt 
tagccgggcg tggtggtggg tgcctgtaat cccagctact tgggaagctg aggcaggaga 

50 actgcttgaa ctcaggaggc ggaggttgca gtgaactgag atcacaccac tgcactccag 
cctgagtgac agagcaagac tccgtcaaaa aaaaagaaag aaagaaagaa aagaaggaag 
gaaagaaaga aggaaggaag gaagggagga agggagagag gaagggagag aggaagggag 
agagagaaaa aaagagggag agagacacaa atacagagac tgagatggga gagagagaga 
gatggaagct ccctcccctc catggccagg gagacagatg gagcaagaga cctcaggggt 

55 gggcagactt ggaggagaag gaccaggagg atgtggagtg ccgaaatctc cagtcagggc 
caggtgggca gtcagagact gcaaaggagg actgtcagac agggacaaaa ggaagccatt 
gatgtaaccg ccctcccgcc tgcccgccgg aagagaggtt gaggccggag ctgctgggag 
catggcactg gggtgctggg aggcggacaa agcccgattg ttcctgggcc ctttccccat 
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cgcgcgtggg cctgctcccc agcccggggc 
tgtagctgtc tccccggctg gctggctcgc 
agcacacaga gggacctacg ggcaggtgtg 
aggctgcttc ccagtgccgg agggctctag 
5 gtggcacaga gagtgctggg ggtgcaggga 
gccggaattt gtcttcagac actttctttc 
gactagaact tcctctgcct cagattcagg 
tttaggtcct agcccctcct ccctcagacc 
gactcaggag tccaggcccc cagcccctcc 

10 ctcctccctc agacccagga gtccaggccc 
caggccccca gcccctcctc cctcagactc 
' tcagacccag gagtccaggc cctcactgca 
cctggtcagg ggtcaccaag agcagagcgt 
gggtaaggag gaaaagggtg tagccagtct 

15 aaaaggacgt tccagaagca tctggggaca 
ctgggggtgt gtgtctggca gtccctgcag 
cttggctctg ccactgcatc tgagtgtctt 
ctttctgcct cctcgtctca aagctgttcc 
ggaggcctgc agcaggtgag atcacagaca 

20 tggccattgc gcacagagcc aggctccgag 
ggccccctat ggtaac'cctc tcctgtcgac 
aaagatctaa tcagaatctc accatgggtt 
ttggcactgt tttcgtggat cctcttggaa 
gctagatggg aaggacagag gtcaaggcag 

25 ggatgcagag ggagcagaca gagggatggg 
tgtggcttcc cctctcagga gcggccatga 
ctgcaggtgg ggaaagggca tttggatggg 
atggagaaga ggctggtatt ggggatgggg 
gaaatgagga agacgttggg gattaggcta 

30 gggaggtggg tttgaagata tgagggtttg 
aaacatagaa gaggtaggag gtaggttgga 
cagttgggtt tgtaatggga atggggtaag 
ttttttgaga cagggtctca ctctgtcacc 
ttcactgcag acttgaactc ttgggtctca 

35 gctgggacta caggcgtatg ccaccatacc 
gtgtgtgtgt gtgtgtgtgg agatgaggtc 
cctgggctca agcgatcctc ctgcctcagc 
accaatcttg actggagttc atgttgaggg 
ctgactcaga tcttctctcc ctcagcctgg 

40 ggaccctgcg acaagacatc tcacccctac 
ctctgtggtg gggtccttat ccatccactg 
ccgtgagtct acactgtaaa tgaacagcag 
atgtcaggca ggaggtgaca taggcatccc 
caggtgcatt cggctgttgc ttaattgagt 

45 gcagtggaaa agaaaataaa aaaaagaaaa 
ctgaacttac tttctaatgg gggaattgga 
atttggattt aattctgagc acagtaggaa 
tttgtttttt gagacacagt ctcgctctgt 
cagctcactg caacctctgc ctcccaggtt 

50 tagctgagat tacaggtgtg caccaccttg 
cggggtttca ccatgttggc caggctggtc 
cctcgccctc ccaaagagct gggattacag 
tacattttta caagcaccct ggctaccacg 
ggaggcccac gtgggggctg ttgctttcat 

55 ggcggtcgca gtggggatgg agggatgttg 
agccagcaga atctggcaac gaggaacagg 
tgtatttgtc ctgaacaact gggtgttttg 
aaagagaaac aggccgggtg taggcagggg 
agttggagat gccggggaga tgtcccagca 



aggggcgggg gccagtgtgg tgacacacgc 
tctctcctgg ggacacagag gtcggcaggc 
tgagtcaccc caaccgcact gaacctgggc 
agcccggagt gagggcctgc aggtccctgg 
ggcctggggc accatctgct tgccccagag 
tccaaaaccc ggaggtctaa ggactgagcc 
ccccagcccc tcctccctca gacccaggag 
caggagtcca agttcccacc tcctccctca 
tccctcagac ccaggagtcc aagttctcac 
caagcccctc ctccctcaga cgcaagggtc 
aggagtccag gcccccaagc ccctcctccc 
ctcagggacc agtgctccct tccctggagg 
gggggcggga ggaatgtgtg tgggaggcct 
cctggctcag ggacctgaga gacaggggtt 
gaaccagcct cttccaggga ggcctgggag 
ccctgggctc tgcggcccct gcgtcctccg 
ctctcctcac ggctccccgc atttctaact 
ttcccccgac tcaagaatcc ccggaggccc 
tcacagaacc tgccgggtgg gcggggtggg 
gaaaactccc atacagagga agaacgctag 
aggaaggcaa atcagtgccc aagaaagtag 
actggaccag tggacgtagt "tgaattctct 
gatgtgggct gaggaagaat aaatcaggag 
gagaccatag cagggcagga aggaaggaga 
gggagggtcg aggcagtgac taatggacca 
agaagctgat ggtggtgctg agtctgattg 
ggaggcttgc agacagggtt gggcttgttg 
atatgcacag ggttggggtg ggggagcttt 
agggtgggga atacagatag ggagggtggt 
gggtggggtt ggctttaggg atggggatct 
aagttggaga gagcccggga ataggggata 
tttgggagtg gaaatacaga gaagcttttt 
caggctggag tgtagtggca tgatccatag 
agtgaccctc ccacctcagc ctcccaagta 
ctgctaattt gtgtgtgtgt gtgtgtgtgt 
tcactgtgtt accgaggctg gtctcaaact 
tgggattaca ggcataagcc actgcacctg 
ggatgcgctt ggtttctcca gaactcctct 
gcagaggagc agaataagtt ggtgcatggc 
caagctgccc tctacacctc gggccacttg 
tgggtcctca cagctgccca ctgcaaaaaa 
atgcgactga accctgaggg tgtcttatag 
ccccatccca gcacgaggcc atctgatagc 
acttaatgtg tgccaggccc tgcgggcata 
caaaaaaaaa caagcaaaat tgctgttttc 
tcatttgggg acctgcaggg cgtgatgggc 
gccactgggc agttttgttt ttgttgtttg 
cacccaggct ggagtgtagt ggcatgatct 
ccagcgattc tcctgcctca gcaccccaag 
cctggctaat ttttgtatgt ttggtagaga 
tcgaactcct gacctcaggt gatccgcccg 
gcatgagcca ccaccacacc cagcctgatt 
tggaacgtgg tctgggcaag agagagggag 
ccggcgacat aggagggtgg cttgaaccca 
aatatcttgg gatgtggaat tctgagactg 
agggagagga agaagcacgg ctggcttccg 
ccacgtcttt ctctgagttg tgggagaggg 
agcatctgac attttgcttt agccacgatg 
gggaggecag ggaggactct ggagctcaga 
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ggagaggtca gggctggagg ttaaaatgaa 
ccatgggact agatgagatc atccaaaaag 
ggacaaaaac cctgggcgct gatcctcact 
gatgctaact accaatcagg tgctgagtga 
5 accacaaggg actcttggca ccattttgca 
aagtgacaat ccctggggtg gttcccctga 
ccatcattca ttcctttgat gtacattgac 
ttgggcagtg gagattcagc aatggatggg 
caaggacaga aaggtgcaga caagcaaagt 

10 atcccagcac tttgggaggc cgaggtgggt 
aacatggctc aaccctgtct ctactgaaaa 
cttctgtaat tccagcaact tgggaggcta 
ggaggttgca gtgagccgag atcgcgccac 
tctgtctaaa aaaaaaaaaa ggaaagaaag 

15 catgcctgta atcccagcac tttgggaggc 
tacaaagctg cagtgagctg tgatctacag 
gagcgagacc ctgtctcaaa aaaacaaaca 
tataattaca aattatgaag gaaaagaata 
ggaaggactt tctaatgaga taaaatccaa 

20 tcagggcaga ggaaaggctg tgataacacc 
agaaaataaa atttcccgtt cactgggggg 
tgactacagc cagatcacac aggggctcca 
ttaggacaat ggggagccat gggtgatgtc 
atatgtatca aacacctatc ctgtgccagg 

25 tgaatagaac aaaaatcccc atcttgacat 
atcagccaca tatagcaaat tacattgaaa 
ttcaagtact cagcagccac ctgtagcttg 
gatcgagatc atggcatcgt agcatttagt 
aacacaatga gtaaatattt aacaataaat 

30 ctctggtgga acagaaagca ggggagggag 
ttttaattgg gcaactaagg aaggcttccc 
gtgagctatg cagatacttg gaggacagac 
ccctgaggtg ggaagatcac tattgtgttc 
ggagcagagg gagagaaggg gagagtggga 

35 tgcttgcaag gcctggtgtg ccacgttgag 
agtcatagga ggggctgagc agaggaggca 
tggttgct'tt gtggaggatg gactgtgggg 
gaggctactg ctctagttca ggtaggaagt 
gtgggaaagg tgagatgtgg ccagattctg 

40 ggacagcttg gatgtagggc atgaaataaa 
tgaaaggatg gaattgccat ttacccagct 
attcatgact tcccagccct ctctgaagcc 
cccagccctc ttccttccca ggaatcttca 
aagggagagt tcccaggagc agagttctgt 

45 tgccgccagc catgaccagg acatcatgct 
tgaactcatc cagccccttc ccctggagag 
catcctgggc tggggcaaga cagcagatgg 
tactggctac ttggggaagt gtgccaaagg 
tgggaagatg ggctaatggt gaggaccaat 

50 gggggaggga gagtgaattt gggagctggg 
gaccaatggg tgaatagcat gggagagatg 
aaggtcagtg gggagatgct aatcaggaag 
attcattgaa cagcaggaag gaataatgga 
aaagcacaaa agccaactga aggatgtgaa 

55 gaagagggac taaggggaaa ggatcaatgg 
tccaatagat cagcaggatc catgaaggtg 
tgaaccattg gatgaagggc cagtgggaag 
tagaaaagga ccaatgaggg aggtggacca 
cagtggggga tggtgaggcc agttagaaaa 



ggcatcgtca gcaaacaggt gtatttaaag 
ctggcatagt tggaggagct ggagggccca 
agtcagattc acgacagctg ccacttgttt 
aaccatgtac acacctttcc tggaatgccc 
aatgaggaaa ctgaggtgca gggaaatagc 
ccccaaggag accttggatg actctcacca 
taagagcacc tgctaagtgc cacattcgag 
acacacacgt catccctgcc ctcgggagca 
gagggctggg catggtggct cacgcctgta 
ggattacctg agttcgagac cagcttggcc 
tacaaaaaat tagccaggcg tggtggtggg 
aggcaggaga attgcttgaa cgtgggaggc 
tgcactccag cctgaaccac agagcgagac 
aagcagcaaa ttgggctggc cgtggtggct 
cgaggcgggt ggatcactcg agcccaggag 
aacaccactg cagatccagc ctgggtgaca 
aacaaaagaa gcaaaccctt caaaacccca 
cgggtaccta ctttagatgg aggagggtca 
gcggaggcat gaagatggga aaaggaatgt 
cctgaggtga gaaccgtctt gagtattctc 
cagaaggtgc tgggagataa ggttggaaag 
gtgccaagtg gaggagccca ggctttattc 
tgagcaaggg agtgactctc tgtttcagga 
tgctgatcaa cgcactggag atactatatc 
cctagagctg cactgtctaa tatggtagcc 
ttaatgaaat- ggaaaatcca caagccacat 
tggttccccc agccacctct ggacagtgca 
ggacagcatt gctctgcaag gaggagaaat 
atatagoagg tcggatgatt gtgataggtt 
ataggaattg cctactaaca ggtatttgta 
tgagaggcga catttaaagg aagtgaggga 
ttgctggcag agggaacagc agtgcaaagg 
aaggcaagac agggaagcca gcgtttggct 
ggagaagatg tctgtgagat gatggggcag 
aactttggct ttgattctga gtgagatggg 
caggaccaac ttacattgtt aaaatatctc 
gaccagagac agagcaggga gcccagtgag 
gaaaaggcag ctcaaaccaa gatggtagcc 
gatatgcttc agagaggcaa aaggaattct 
gagagtgaag aatagccccc aagattattc 
ggggaagact gtgggaggag caggccagcg 
tcaactgcag cccaagggct ccaggtgaga 
ggtcttcctg gggaagcata accttcggca 
tgtccgggct gtgatccacc ctgactatga 
gttgcgcctg gcacgcccag ccaaactctc 
ggactgctca gccaacacca ccagctgcca 
tcagtagtgg gaggctggtg gggagcaggc 
atggggagtg ggaaaattgg tgaggggcca 
gggacagggt ttcaatggga gaaaggtcaa 
ccagtgagtg aacagccaat ggaaaatgta 
gaacataaga tgaaggttca ataaagaggg 
gatgtcaaag gtcaaagggg actgatcagg 
gaaggaactg atggaagaag agaaaccaat 
ttgagacagt gaatgggggt atagctgatg 
tccagaggag tcactagagg aaaaaacagg 
ggcctgtgtg tgaagggcca ataagaaagg 
gcagagacaa tgggggagga tgcggcaagt 
ttggatgaag ggctaatagg aagggagagc 
ggaccaagga gggaagcaga ccaataggaa 



33/47 



gagagagcca atgagggagg 
attggaggaa gggccaatag 
aggaccaatg atggaggtgg 
gagagggcat ggccagttag 
caatgggcag gaagtgtcca 
agggcgtaac agaggaagag 
gtggaagaga gaaaagtgga 
gactcctgga gaagagacta 
aagagggacc aattaggagg 
aggaagaggg gtgccaatag 
ggggggtgac tggggaggat 
cccctaacag gtgatttccc 
gaggagtgtg agcatgccta 
gagaagtacg ggaaggattc 
agggacagga cgaagtcaca 
aagagagctt tacagagaca 
agagacttag ttcaacacac 
ccagcagaga caggaagtgc 
gccaggagca gcggctcatg 
cacctagggt caggagttcg 
aaaatacaaa aattaggatg 
gccgaagcag gaggatcacc 
aaccctatct ctactaaaaa 
cccagcacct tgggaggccg 
ctggccgata tggtgaaacc 
caggcgcctg tagtcccagc 
ggcggaggtt gttgcagtga 
aagattccgt ctcaaaaaaa 
ctgtagtccc agctactcgg 
ggctgcagtg agctgagatc 
tctaaaaaca aaaagaacca 
ccttcctcaa acagagcccc 
agacactagc tggggaaagg 
gtcatccatc caggctctcc 
agaccaacca agggggagac 
acagggaaag cgatacatag 
ctcaagacac gaggtggaga 
gggccctccc tacctctccc 
ctcctcttcc tcctcctcct 
gtctctacac ctctgcctct 
tcttgctctc tatgttcctc 
ttattctctc tctaccattc 
tctctctgtc cctgagtctt 
tctctctctg tcacacacac 
tctgggtttc tatctgtatc 
cgctggtatg tggagaccac 
caaaggagaa gccaggagtc 
ccattcaggc caagtgaccc 
ctggttccag aacgtctctc 
ccctgatgct taataaacgc 
ctccatcctt gcatcactgg 
ccccaccact aagagaatac 
cacacgtttg atttcttcct 
cttactgtcg gtgtcccctt 
atgaagataa ggatgataca 
cacacctatg acatacatgg 
ttt 



gcagggccag ttaggaaagg 
aaagggagga tccatgaggg 
accattggat gaagaaccaa 
gaaaagacca atggtcacag 
atgaagaatg gactactgat 
tcctccaggt caactgaaac 
ggagggacct aagagaaaag 
ttaatgagga agacagccaa 
cagggacgat ggtaatgaga 
aaaagaggga ccaatagagg 
gaggggagtg caaggcctgg 
tgacaccatc cagtgtgcat 
ccctggccag atcacccaga 
ctgccaggtg aggtgacccg 
aaaacatggc cagacacagg 
gatagagaca ggctgaggga 
agagacacag tcagggatat 
agagacaagg atggaggccg 
cctgtaatcc cggcactttg 
agaccagcct gatcaacatg 
ggcacagtgg ctcatgcctg 
tggggtcagg agttcgagac 
tacaaaaatt aggatgggca 
aagcaggagg atcacctggg 
ctatctctac taaaaataca 
tactcaggag gctgtggcag 
gtcgagatca tgctactgca 
aaccaaaaaa caaaaattac 
gaggctgagg ctggagaatt 
acgccactgc actccagcct 
aagagaagta gtaaggaagc 
cacgagtcct gctcagaaac 
ggactccctc ccgaatactt 
tctttatgcc agaatgacta 
acaggcagaa acggagacac 
caagttggac gcaaagaaag 
ggtgtccctg gacagaatag 
acctgggtct tatcgtctcc 
cctccctcat catcttcttc 
ctccacacct ctcagtctcc 
tgcatcttgg cattcctatc 
tctctctgtg cctttgtgtg 
tctctccatc tttcagtaag 
acacacacac acacacacac 
tgactttctc cctctttcct 
ctccgaggcc ttgtgtcatg 
tacaccaacg tctgcagata 
tgacatgtga catctacctc 
acctagacct tgcctcccct 
agcgacgtga gggtcctgat 
ggaggacgtg atgagtgagg 
aggaaaatcc cttctaggca 
gcagaggccc agccacgtgt 
gggatgtacc tttcttcact 
gtctccataa ggcagtggct 
aatagcacct gggccaccat 



accaatgagg aaggtagacc 
agggtgggga cagttagaaa 
tagaaaggaa gaaccaatgg 
agtgaccaat caagatgaat 
caggaggggt acagtagagg 
tactgaagaa ggtgggacca 
gaaaaccaat aggaaatgag 
tgggggggaa gaatgataga 
tgtaagaatg agagacaaac 
atggaggact tataggggtt 
gctgagtctg gcccatctct 
acatccacct ggtgtcccgt 
acatgttgtg tgctggggat 
gatctgccac ttacacagcc 
aagagagaga cacaggccaa 
gaacccaagc cttgaaaaga 
gcagagatat aaagacacag 
cgggatcaag aaccagagag 
ggaggccgaa gcaggaggat 
gtgaaaccct atctctacta 
taatcccagc accttgggag 
cagcctgatc aacatggtga 
cagtggctca tgcctgtaat 
gtcaagagat tgagaccagc 
aaaattagct gggcctggtg 
gagaatcact tgaacctgga 
ctccagcctg gcaacagagc 
gcaagcatgg tgggacacac 
gcttaaaccc aggaggcaga 
ggggacagag ccagactctg 
agatggtgtg aggggactgt 
gaccaggctc tggaggaggg 
taacttgggt ttcctccatt 
atgcactgag ggatgtgcag 
aggcagaaac agggacagag 
ggcaggtggg cgagactgtc 
tgccaggcat atctctccct 
tcctccccct cctccctctc 
ttttctctct ctctccatcg 
attcttaaat tgtttctctt 
tctgtgtctt tgagtctcct 
tcttactgtc tctctctctg 
tacctctgtc cctttctacc 
acacacacac acacacagtc 
gcagggtgat tctgggggtc 
gggtaacatc ccctgtggat 
cacgaactgg atccaaaaaa 
ccgacctacc accccactgg 
cctctcctgc ccagctctga 
tctccctggt tttaccccag 
acttgggtcc tcggtcttac 
tctcctctcc ccaacccttc 
ctggaatccc agctccgctg 
gcagatttct cacctgtaag 
gttggaaaga tttaaggttt 
gcactcaata aagaatgaat 
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SEQ ID NO. 9 

KLK6 nucleic acid 

CDS 246. .980 



aggcggacaa 
agcccggggc 
gctggctcgc 

10 ggcagctgtt 
cggccatgaa 
agaataagtt 
tctacacctc 
cagctgccca 

15 aaagggagag 
atgccgccag 
ctgaactcat 
acatcctggg 
acatccacct 

20 acatgttgtg 
gtccgctggt 
gatcaaagga 
aaaccattca 
tggctggttc 

25 tgaccctgat 
cagctccatc 
tacccccacc 
ttccacacgt 
ctgcttactg 

30 aagatgaaga 
tttcacacct 
tatttt 



agcccgattg 
aggggcgggg 
tctctcctgg 
ccttcccccg 
gaagctgatg 
ggtgcatggc 
gggccacttg 
ctgcaaaaaa 
ttcccaggag 
ccatgaccag 
ccagcccctt 
ctggggcaag 
ggtgtcccgt 
tgctggggat 
atgtggagac 
gaagccagga 
ggccaagtga 
cagaacgtct 
gcttaataaa 
cttgcatcac 
actaagagaa 
ttgatttctt 
tcggtgtccc 
taaggatgat 
atgacataca 



ttcctgggcc 
gccagtgtgg 
ggacacagag 
actcaagaat 
gtggtgctga 
ggaccctgcg 
ctctgtggtg 
ccgaatcttc 
cagagttctg 
gacatcatgc 
cccctggaga 
acagcagatg 
gaggagtgtg 
gagaagtacg 
cacctccgag 
gtctacacca 
ccctgacatg 
ctcacctaga 
cgcagcgacg 
tggggaggac 
tacaggaaaa 
cctgcagagg 
cttgggatgt 
acagtctcca 
tgggatagca 



ctttccccat 
tgacacacgc 
gtcggcaggc 
ccccggaggc 
gtctgattgc 
acaagacatc 
gggtccttat 
aggtcttcct 
ttgtccgggc 
tgttgcgcct 
gggactgctc 
gtgatttccc 
agcatgccta 
ggaaggattc 
gccttgtgtc 
acgtctgcag 
tgacatctac 
ccttgcctcc 
tgagggtcct 
gtgatgagtg 
tcccttctag 
cccagccacg 
acctttcttc 
tcaggcagtg 
cctgggccgc 



cgcgcctggg 
tgtagctgtc 
agcacacaga 
ccggaggcct 
tgcagcctgg 
tcacccctac 
ccatccactg 
ggggaagcat 
tgtgatccac 
ggcacgccca 
agccaacacc 
tgacaccatc 
ccctggccag 
ctgccagggt 
atggggtaac 
atacacgaac 
ctcccgacct 
cctcctctcc 
gattctccct 
aggacttggg 
gcatctcctc 
tgtctggaat 
actgcagatt 
gctgttggaa 
catgcactca 



cctgctcccc 
tccccggctg 
gggacctacg 
gcagcaggag 
gcagaggagc 
caagctgccc 
tgggtcctca 
aaccttcggc 
cctgactatg 
gccaaactct 
accagctgcc 
cagtgtgcat 
atcacccaga 
gattctgggg 
atcccctgtg 
tggatccaaa 
accaccccac 
tgcccagctc 
ggttttaccc 
tcctcggtct 
tccccaaccc 
cccagctccg 
tctcacctgt 
agatttaaga 
ataaagaatg 



35 

SEQ ID NO. 10 

hk7 amino acid 

MARSLLLPLQILLLSLALETAGEEAQGDKIIDGAPCARGSHPWQ 
40 VALLSGNQLHCGGVLVNERWVLTAAHCKMNEYTVHLGSDTLGDRRAQRIKASKSFRHP 
GYSTQTHVNDLMLVKLNSQARLSSMVKKVRLPSRCEPPGTTCTVSGWGTTTSPDVTFP 
SDLMCVDVKLISPQDCTKVYKDLLENSMLCAGIPDSKKNACNGDSGGPLVCRGTLQGL 
VSWGTFPCGQPNDPGVYTQVCKFTKWINDTMKKHR 



45 



SEQ ID NO. 11 
KLK7 nucleic acid 
CDS 16. .777 



50 ggatttccgg 
ttagccttgg 
tgtgcaagag 
ggaggcgtcc 
tacaccgtgc 

55 tcgaagtcat 



gctccatggc 
aaactgcagg 
gctcccaccc 
tggtcaatga 
acctgggcag 
tccgccaccc 



aagatccctt 
agaagaagcc 
atggcaggtg 
gcgctgggtg 
tgatacgctg 
cggctactcc 



ctcctgcccc 
cagggtgaca 
gccctgctca 
ctcactgccg 
ggcgacagga 
acacagaccc 



tgcagatcct 
agattattga 
gtggcaatca 
cccactgcaa 
gagctcagag 
atgttaatga 



actgctatcc 
tggcgcccca 
gctccactgc 
gatgaatgag 
gatcaaggcc 
cctcatgctc 
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gtgaagctca atagccaggc caggctgtca tccatggtga agaaagtcag gctgccctcc 

cgctgcgaac cccctggaac cacctgtact gtctccggct ggggcactac cacgagccca 

gatgtgacct ttccctctga cctcatgtgc gtggatgtca agctcatctc cccccaggac 

tgcacgaagg tttacaagga cttactggaa aattccatgc tgtgcgctgg catccccgac 

5 tccaagaaaa acgcctgcaa tggtgactca gggggaccgt tggtgtgcag aggtaccctg 

caaggtctgg tgtcctgggg aactttccct tgcggccaac ccaatgaccc aggagtctac 

actcaagtgt gcaagttcac caagtggata aatgacacca tgaaaaagca tcgctaacgc 

cacactgagt taattaactg tgtgcttcca acagaaaatg cacaggagtg aggacgccga 

tgacctatga agtcaaattt gactttacct ttcctcaaag atatatttaa acctcatgcc 

10 ctgttgataa accaatcaaa ttggtaaaga cctaaaacca aaacaaataa agaaacacaa 
aaccctcaa 

SEQ ID NO. 12 
KLK7 nucleic acid 

15 

mRNA 

join (1756. .1785,3179. .3309, 3722. .3869,4566. .4813,5129. .5265,7362. .8265) 
/product=" stratum corneum chyraotryptic enzyme" /note="alternatively 
spliced" 

mRNA join (1756. .1785,3179. .3309,3722. . 3869, 4566 .. 4813, 
5129. .5265,7362. .7991) /note="alternatively spliced" 

mRNA 

25 join (1821. .1864,3179. .3309,3722. .3869,4566. .4813,5129. .5265,7362. .8265) 
/product="stratum corneum chymotryptic enzyme" /note=" alternatively 
spliced" 

mRNA 

30 join (1821. .1864,3179. .3309,3722. .3869,4566. .4813,5129. .5265,7362. .7991) 
/note="alternatively spliced" 

CDS join (3237. .3309,3722. .3869,4566. .4813,5129. .5265, 7362. .7517) 



20 



35 

ggcatggtgg tgcacgcctg taatccagct actcaggact ctgaggcagg agaatcactt 
gaacacgggg gagtggaggt tgcagtgagc cgagatcgtg ccattgcact ccagcctggg 
tgacagagcc agagtccatc aaaaaaaaaa aaaaataaga aagattcttc tctcctctat 
gtgtccatgc agtctcatca tttagctacc acttgtaagt aggaacatgc catatctggt 

40 tttctgttcc tgctttagtt tgtaagggta atggcctcca gctccattca cgtccctaca 
aaggacatga tcgtgttctt ttttatggct acgtagtatt caattgtgta tacgtaccac 
attttcttaa tccagtctat cactgatgga catttaggtt gattccctgt gtttgctgtt 
gtcaatagtt ctacaatgaa cgtacgtgtc catgtgtctt taaacagaat gatttatatt 
cctttgggta cacacactgg ggcttatgag agggtggaga gtgggaggaa ggagaggatc 

45 agaaaaaaat aactaatggg tactaggctt aatacctggg tgattaaata atctgtataa 
caaaccccca tggcgcacgt tcacctacgc aacaaacctg cacatcctgc acatgtaccc 
ccgaactgaa aagttaaaaa aagaaaaata aatatttgct tataaattaa taaatgaagc 
cctcaaaaat gttctattag ataatgttaa gtacagacat ttttgttata aatacataat 
atacaaagaa atctatgtat aacatgatta aaatgaccat aagaacatag atcctaaaca 

50 tggcaaatat tagtggggtg gggttaggga aagcgttgtt tttaacttac acctctctgt 
tagagttggg aatgggttca ggcgtaatta caggcacgac tgggatcagc ttggacaagt 
tcccccaggc gggccagaat taggatgtag ggtctaggcc acccctgaga gggggtgagg 
gcaagaaaat ggccccagaa gccgggcgca gtggctcacg cctgtaatcc cagcactttg 
cggggccgag gcgggcacat catgaggtca ggagatcgag accattctgg ccaacatagt 

55 gaaacccggt ctctactaaa aatacaaaaa ttagctggga gtggtggtgc gtgcctgtaa 
tcccaggtac tcgggaggct gaggcaggag aatcacttga acctgggagg cggagctggc 
agtgagccga gatcgcgcca ccgcactcca gcctggcgat agagagagac tccatccaaa 
aaaaagaaag gaagggaggg agggaggagg gaagaaagaa agaaaaccgc cccagagaag 
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gacccgagcc agagcctatt ctctgagctc 
cagacccagc gcaccgggtc cctcccccga 
■ tgtgcggggg cgcaggagct gggcaccgcc 
gcagtggtgc caattgctct cgctgcgtca 
5 ggcgggggtg tcccagagtc ggctccgcct 
gcggggggtg atataagagg acggcccagc 
gtgccagccc aagtcggaac ttggatcaca 
atcaagggtg cacctgtagg ggagagggtg 
agaggtaggg gctgcagcct agactcccgg 

10 atttggggcc ttttagggaa aaacgaacta 
gggcaagatc ggagcctaga ttcctgggtc 
tctgagggag gaggggctgg ggttctggac 
cctggactcc tgggtctgag ggaggagggg 
ggaggggctg ggggcctgga ctcctgggtc 

15 taggtctgag ggaggaggag ctggggcctg 
ggcctggact cctgggtctg agggaggagg 
aggaggggct ggggcctgga ctcctgggtc 
tgggtctgag ggaggagggg ctggggcctg 
gacctggact cctaggtctg agggaggagg 

20 aggaggagct gggggcctgg actcctgggt 
ctgggtctga gggaggaggg gttggggcct 
ggacctggac tcctaggtct gagggaggag 
ggaggcgggg ctgggggcct ggactcctgg 
tcctgagcct gagggaggag ggacttggac 

25 tgggggcctg gactcctagg tctgagggag 
gagggaggaa ggtgctaggg tctggactct 
tggacttctg agtgtaagga aggagaggcc 
aggggctggg gttctggacc cctaggtctg 
ggtctgaggg aggaggggct ggggccggta 

30 cctggacccc tgggtctgag tggggagggg 
ccagcaggag aggcccttcc tcgcctggca 
caagatccct tctcctgccc ctgcagatct 
gagaagaagg tgaaagctgg actgggaagt 
tctccaagga gtccctgagt cagaaccctt 

35 accagactgt cggctgaagc ccccgctctt 
cccctccagg ccctgcctcc tgtgctcagg 
cctcctgggt ttctgagtgg gctccttctg 
tgtatttctc cacacatttt catctgtctc 
gtgcagcctc agccttcccc taatgctagc 

40 gcccagggtg acaagattat tgatggcgcc 
gtggccctgc tcagtggcaa tcagctccac 
gtgctcactg ccgcccactg caagatgaag 
caccagcgtc tccagctcgc tatgggggtg 
attctcctct ctctgggtct ctgtcctcct 

45 gggctctgtc cccctctctc cctggctctc 
tctctctgga tctctgttcc cctctctctg 
tgttccccct' cctctctctc tgggtctctg 
tctctctggg tctctgttcc cctctctctg 
tctctccccc ctcctctctc tgtgtctctg 

50 ccctcctctc tctgggtctc tgtccccccc 
gctctgtccc cctctctctc tggatctctg 
cctctctctg tgtctctgtc ccccctcctc 
caggtctttg cacaccctct ctgtcacagt 
cgcagtgagt acaccgtgca cctgggcagt 

55 atcaaggcct cgaagtcatt ccgccacccc 
ctcatgctcg tgaagctcaa tagccaggcc 
ctgccctccc gctgcgaacc ccctggaacc 
acgagcccag atggtaggtg gcctcagtga 
tcagacccag gagtccaggc ccccagcccc 



agcgactgct tgaatcccgc tcctgcccct 
gagcagccag gagggactgt gggaccagaa 
cgtccttcgg agggagggtg gagagagagt 
gggttccaga* taaccagaac cgcaaatgca 
gcaccccagg gcgctggggc cgggcatggg 
agagggatga agattttgga gcccagctgt 
tcagatcctc tcgaggtgag aagaggcttc 
atgctggctc caagcctgac tctgctctcg 
tcctgagcag tgagggcctg gaagtctgca 
cagagtcaga agtttgggtt ccacagggaa 
tctagggatc tgaagaacag gaattttggg 
tcctgggtct gagggaggag ggcctggggg 
ctgggggtct cgactcctgg gtctgaggga 
tgagggagga ggggctggga cctggactcc 
gactcctggg tctgagggag gaggggctgg 
ggctggggcc tggactcctg ggtctgaggg 
tgagggagga ggggctgggg cctggactcc 
gactcctggg cctgagggag gagggactga 
gactgggacc tggactcctg ggtctgaggg 
ctgagggagg aggggctggg gcctggactc 
ggactcctga gcctgaggga ggagggactt 
gagctggggg cctggactcc taggtctgag 
gtctgaggga ggaggggttg gggcctggac 
ctggactcct aggtctgagg gaggaggagc 
gaggggctgg gggcctggac tcctgggtct 
tgggtatgag ggaggaggag gttaggggtc 
agagaaagga atttctgggt ctgagggagg 
agggaggagg ggctggggcc tggacccctg 
ctcctgggtc tgtgggggga ggggctgggg 
ctgggcctga atgctttctc cttctcagct 
gcccctgagc ggctcagcag ggcaccatgg 
tactgctatc cttagccttg gaaactgcag 
ctgacctcac ctcagggccc ccactgaccc 
ccctcctcaa acagcttcca tcctgggagg 
cctgcttctg ctgactcagg gggtctctgt 
gtctctctgt ggttccccag atgagatgcg 
tctgtctcta tccctatctc ttgctttctc 
tgtccatctc tgactctggg aatccctgag 
tacccacgtg ctcctccatg tctccatcca 
ccatgtgcaa gaggctccca cccatggcag 
tgcggaggcg tcctggtcaa tgagcgctgg 
taggtgccgc ccaagtctct gctggaggtg 
gaagggcagt ctttctgtgc ctacggctct 
ctctctgggc ctctgtaccc cctctccctg 
tgtctccctc tctctgggtc tctgtccccc 
tgtctctgtt ccccattctc tctaggtctc 
tccctctctc tctggatctc tgtccccctc 
ggtctctgtc ccctctcctc tctctgtgtc 
tcccccctcc tatctctgtg tctctctccc 
tctctgggtc tctgtctccc tctctctggg 
ttcccctctc tctgggtctc tgtctcccct 
tctctgggtc tctgtcccca ccccgtcccc 
gtctcttctg aatctgtgaa tgtcactcct 
gatacgctgg gcgacaggag agctcagagg 
ggctactcca cacagaccca tgttaatgac 
aggctgtcat ccatggtgaa gaaagtcagg 
acctgtactg tctccggctg gggcactacc 
cccaggagtg caggccccag ccctcctccc 
tcctccctca gacccaggag tccaggcctc 
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agcccctcct ccctcagacc caggagtcca 
gagtccagac cccagcccct cctccctcag 
ccctcggaac caggagcctg aacaacagcc 
tgacagctct ccctgctcct ccctgcagtg 
5 gtcaagctca tctcccccca ggactgcacg 
atgctgtgcg ctggcatccc cgactccaag 
ccaattcctc cccagtcctg ggtaccctgt 
agtgactggg taccaagccc ggccttgccc 
ccacctcatt ctctgcctag gtcaggggtg 

10 gggatgagac agagagttta ataggggtga 
tgaggtgctg gcctcaggcc caaaccttaa 
aaatcataat gcaatattta aaaataaaaa 
aaaatgtcac attttaaata aagagcaggt 
tactagcgtg gctcagcaca gcgctgtact 

15 atgcccatca tgcagtttta tgtattacat 
tgtgtattgc agttactgag attttgtgcc 
gccctggtcc cggggaaaac actctttctc 
cttttgtcat cccctctgtt tctgaacagt 
catttcattc ttttcctctt tgttttttct 

20 tgttctctct ccatgccctc ctctctgctc 
tctctctcct cccctccctc tctcctctcc 
tctctgtctc ctctctggcc ctctcctctt 
tcatctctct ccctcatctc tccttgcccc 
cttctatctc tctcctctcc ctgccgctcc 

25 ■ ttctcctcct ctcttccagt ctctctctcc 
acaccttccc cccctttctc tttgtctctc 
tctctctctc ttctcttccc acaccctccc 
tctccctcct tcttttccac ccccatctct 
tcttcctcat ctctctttgt ctctctctcc 

30 ctctgtctct ccacacccat cctccttgct 
tgggatggtg agtgttaggg atagaggaga 
ggtgttcccc ttctcccctg gtgagggcca 
acagagcccc acactcagaa gggtctcaaa 
gaaattctca ataatttttg aacaaagggc 

35 aaattatgta actggtcttc accctggtct 
cacagggcac gcatccaccc cttggagatg 
gtccttaaca ttggaaaata aagagtgctc 
attggtctca ttggccaagg gtcaaaccag 
actagctctc ccattagtcc ccagagacaa 

40 tccataatct gcaagacaaa gaccgataac 
gatctgaagc caaagttaat ctccggctct 
tggtgtgcag aggtaccctg caaggtctgg 
ccaatgaccc aggagtctac actcaagtgt 
tgaaaaagca tcgctaacgc cacactgagt 

45 cacaggagtg aggacgccga tgacctatga 
atatatttaa accaacctca tgccctgttg 
accaaaacaa ataaagaaac acaaaaccct 
actctcaaac actggaactg gacgttcgta 
acaccgagac ccttattcac cacctttgac 

50 tgaaacaaaa aaaatccaaa atgtagaaca 
acagaaatga agtgaaacca tcaaacatgt 
tctggcttgg cacaacgatg ttttttttct 
aatgcagtga tgcaatcttg gctcactgca 
gtgcttcagc ctcccaagta cctgggacta 

55 tttgtgtatt tttactagag acagggtttc 
tgacctcaga tgatccaccc accttggcct 
accacggcca gcccacaatg atattacaaa 
tcagtattat tcaagaacat ttaggctata 
atatatatgt atgtgaccct acccataaaa 



ggcccccagc ccctcctccc tcagacccgg 
acccagcagt cctgggcccc agaccctcct 
cttctggtcc tcgcccccat cctctctgac 
acctttccct ctgacctcat gtgcgtggat 
aaggtttaca aggacttact ggaaaattcc 
aaaaacgcct gcaatgtgag accctccccc 
ctgcatgccc cagggacaga gcttgacccg 
tccccccagg cctggcctcc tcagcttttt 
ggagtttact taggggccaa tgtggccctg 
gaaagtgggg gtgggaccag ggaaggagac 
gggggcacca aaaacctcag tgattgagat 
taaaaactca tgcagaagtc catgatggac 
ggatcttact gaattttccc * ttgccgtaag 
ggcactgtct tcatttaaaa tgtggatacc 
ttgatttcgt taagtactgc attgaagtat 
tgaagctgat gactcactca cctgaccctg 
tccacctcct ctctgttccc tctttctggc 
cttcccacat ctctctttgt gacataattt 
ctgtgttgag ctagcttgct ctccctccct 
tctgtcttct ccctctttct cttgcttctc 
ctgcccccct gctctctctt ttttcctctc 
tctctctctc ccccacttct ctgtctctct 
ctccttttta ctgtctctct ctttctcttt 
cccatctctg tctttctttc tctctcttta 
tctccccacc cccaccccat ctctctcccc 
tcttctccct ctttcttctc cacccccatc 
catctccctc atctctttgt ctgtctctct 
ctgtctctct ctctccccat accctttccc 
tttccctctt tcttctccac ctccaactct 
cacatctgca ccttcagctg tcaggggatg 
tgggagagag atgactgtcc tagagaatag 
gtttcatgaa tgtgcaagct ctgcacggac 
cttagtctaa tgcattcctg ctgttgtctt 
cctgcatttt cgttttgcac caagtcctgt 
ccgagaccat cgtgtccccc tttcctgcgc 
atgttccttc tcccactagc ttggagcagg 
tgatcctgga agccccaccc cttctctgca 
tgtcttcaaa ggacctagtg tgtccctagc 
tgagtctctt ctcattggct atggtggaag 
tgaggaatgt atgagaatga gttgggcttt 
attccctcta gggtgactca gggggaccgt 
tgtcctgggg aactttccct tgcggccaac 
gcaagttcac caagtggata aatgacacca 
taattaactg tgtgcttcca acagaaaatg 
agtcaaattt gactttacct ttcctcaaag 
ataaaccaat caaattggta aagacctaaa 
cagtgctgga gaagagtcag tgagaccagc 
cagtctttac ggaagacact tggtcaacgt 
ccagtaactc taatcttagg aagaacctac 
agacttgaat ttaccatgat attatttatc 
tccaaaagta ccagatggct taaataatag 
ttgagacaga gtctctgttg cttgggctgc 
acctccgcct cctgggttca agtgattctc 
caggtgtgca ccaccacacc aggctaattt 
accatgttgg ccagcatggt cttgaacgcc 
cccaaagtgc tgggattaca ggcatgagcc 
cctattaaaa atgatactta gacagaattg 
ggatgttaaa tgacaaaagg aaggacaaaa 
aatgaaatat tcacagaatc agatctgaaa 
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acacatgtcc 
tttccttgaa 
tctaagattg 
ttttgagccc 
5 tttttcctgt 
ggagggagga 
agcatctcac 
cagtagacag 
ggaagatggt 

10 gatgagggag 
aagatcctct 
acacgccctc 
tccagatcct 
agccctgtca 

15 taacacagaa 
agaccccccg 
tgtttctggg 
gttttagcaa 
aaaagtaaca 

20 ctgctctcct 
gttgattttt 
ggactagctc 
ttttccttta 
aagcacctac 

25 aaacattgt 



cagactgcat 
tgtgcacttt 
gagagaggtg 
tggtcctgcc 
ctgtaaaatg 
gagaacaggc 
gagtgacaag 
gacacagggg 
ggcacctgct 
gagtcctctg 
gtgtggtcac 
ctacctgttc 
tcccctcctt 
tattgcagaa 
aacgcaggag 
ccccaacccc 
cctgtcaagt 
cattttctct 
aaacattgca 
atacgcaagc 
tttaagtaat 
tggatcttgg 
agctcattac 
tttctagggt 



actggggtcg 
tataacatga 
acctttcagg 
ggccctgttc 
ggagagagag 
caacttcatc 
tgaggaggga 
tcccacgggg 
ccccaagaag 
tgactcagag 
acctcagacg 
ttcctgtttt 
atctcatctc 
attctgcagc 
tccaggcccc 
tcctccctca 
ttaagaatgt 
ctcttctgca 
tttgcactaa 
tacaggtaga 
tcatttttct 
cttcttgggt 
ttcccctgtc 
tattacagag 



tcatgaggtg 
aaaataaagg 
aagggagact 
cagggcatat 
gaaaggatgg 
agcgtgggaa 
ggctggcggt 
gtctgccaga 
ggagggaaag 
cctggccaca 
ctgctgaccg 
tctcccagaa 
cctctgagtc 
cgctaattct 
cagcccctcc 
gacccaggag 
caaacatttt 
aggcactcca 
gtcagcctgg 
ttggtttgca 
ttgggtaagc 
tcaaatccca 
cctgttcctt 
attcaataag 



tctccttcct 
tggggaaaaa 
agaaagaaat 
ttccatttcc 
agagaggaag 

ggggtgtgaa 

tttcagaggg 
agtaagcaaa 
gaacctcggg 
gccccagcca 
aggagccact 
ttccctcccc 
tctcctaacc 
gattctccca 
ttcctcagac 
cccaggtccc 
cgaccagtca 
acattcaatc 
agatccctgg 
atgactgaga 
agtatagtgt 
gttctagtcc 
catccttgaa 
ttaatataca 



tctgtgtact 

agtctgaaga 

atgtgcctgg 

cagatctcag 

aaggaaggga 

agtgtttctg 

attgggatga 

cagtgccgga 

aagcgggtag 

tctaacatca 

ccagcccagg 

accaagatcc 

caggcaccac 1 

tataggaggc 

ccaggagtcc 

cagccccttc 

ttcccctgaa 

tggaatttta 

ccctggccct 

tggtactaat 

Vjgtagttaag 

ctacaagcta 

atgggagaaa 

gaaagtgctc 



SEQ ID NO. 13 

Hk8 amino acid 



30 



35 



MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQ 
PWQAALFQGQQLLCGGVLVGGNWVXTAAHCKKPKYTVRLGDHSLQNKDGPEQE I PVVQ 
SIPHPCYNSSDVEDHNHDLMLLQLRDQASLGSKVKPISLADHCTQPGQKCTVSGWGTV 
TSPRENFPDTLNCAEVKIFPQKKCEDAYPGQITDGMVCAGSSKGADTCQGDSGGPLVC 
DGALQGITSWGSDPCGRSDKPGVYTNICRYLDWIKKIIGSKG 



SEQ ID NO. 14 
KLK8 nucleic . acid 

40 CDS 35. .817 



gtgaccccgc 
caagacgtgg 
ggaggacaag 

45 cttgttccag 
tacagctgcc 
gaataaagat 
caacagcagc 
ggcatccctg 

50 ccagaagtgc 
cactctcaac 
ggggcagatc 
gggcgattct 
ctcagacccc 

55 ggactggatc 
ccttaataaa 



ccctggattc 
atgttcctgc 
gtgctggggg 
ggccagcaac 
cactgtaaaa 
ggcccagagc 
gatgtggagg 
gggtccaaag 
accgtctcag 
tgtgcagaag 
acagatggca 
ggaggccccc 
tgtgggaggt 
aagaagatca 
ctcacaactc 



tggaagacct 
tcttgctggg 
gtcatgagtg 
tactctgtgg 
aaccgaaata 
aagaaatacc 
accacaacca 
tgaagcccat 
gctggggcac 
taaaaatctt 
tggtctgtgc 
tggtgtgtga 
ccgacaaacc 
taggcagcaa 
tctggttc 



caccatggga 
gggagcctgg 
ccaaccccat 
cggtgtcctt 
cacagtacgc 
tgtggttcag 
tgatctgatg 
cagcctggca 
tgtcaccagt 
tccccagaag 
aggcagcagc 
tggtgcactc 
tggcgtctat 
gggctgattc 



cgcccccgac 
gcaggacact 
tcgcagcctt 
gtaggtggca 
ctgggagacc 
tccatcccac 
cttcttcaac 
gatcattgca 
ccccgagaga 
aagtgtgagg 
aaaggggctg 
cagggcatca 
accaacatct 
taggataagc 



ctcgtgcggc 
ccagggcaca 
ggcaggcggc 
actgggtcct 
acagcctaca 
acccctgcta 
tgcgtgacca 
cccagcctgg 
attttcctga 
atgcttaccc 
acacgtgcca 
catcctgggg 
gccgctacct 
actagatctc 
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SEQ ID NO. 15 

KLK8 nucleic acid 

CDS join (<1. .39, 
Exon <1. .39 
Exon 418. .712 
Exon 878. .946 



418. .712, 878..>946) 



10 tcttcggttc 
ttctcctttt 
atccaggccc 
cctccctcag 
gtcccagtcc 

15 cccaggaatc 
ctttatgttt 
tctgagattc 
actcacaccc 
aggaggagca 

20 gaactgaagt 
attctggaag 
ctccagattc 
ccccggcttc 
acccagacat 

25 tccttcctcc 
aaatccagaa 
tcccagacct 
tcttgctggg 
tctgaaggca 

30 gtgtgcacgg 
tggacagagg 
gtctgaatag 
actcggggcc 
gtatgatgac 

35 ctcctcacta 
tcccagccct 
cactccaggg 
ccttggcagg 
ggcaactggg 

40 ggtcagctgg 
aggactggac 
ccaaactatc 
aataaagatg 
aacagcagcg 

45 gcatccctgg 
cagaagtgca 
cactaatggg 
cttggtcccc 
aggagtcata 

50 caactagcag 
cctatgggac 
gacttctcag 
tttttttttt 
gagtgcagtg 

55 cgaacctcag 
ttgtatttgt 
cctcaagtga 
gcccctggcc 



ccggttactg 
aaaagtactt 
ctagcccctc 
acccaggagt 
ctcctccctt 
caggccccca 
cagatcgtag 
cagcccctcc 
agtgttccag 
ggaggtggag 
actggggcct 
gtgaggtgca 
caggacccca 
ctcctctccc 
cgaggactcc 
agagaagccc 
ctgggagctc 
caccatggga 
gggagcctgg 
gcagtggcgc 
atgtgaagag 
caagaaacag 
ctacagcctt 
tgtagactca 
acatggactg 
agttgtatgc 
cgcactgccc 
cacaggagga 
cggccttgtt 
tccttacagc 
ggcttaagga 
tcctgggtct 
cccaccatta 
gcccagagca 
atgtggagga 
ggtccaaagt 
ccgtctcagg 
agggagagga 
cagagggaga 
tgagaagcct 
attgttaagc 
ctttggttgt 
aggcccagga 
tttttttttt 
gcacaatctc 
cctcccgagt 
tgtagagaca 
tctgcccgcc 
tttttctcct 



gcagcagccc 
agaattcagc 
ctccctcaga 
ctgggccccc 
agatttagga 
gcctcctccc 
agtctcagca 
tccctcaaga 
ttcccagaag 
attcccagtt 
cctccactgg 
gaggtactca 
gcctcagatg 
taatttagga 
cccctccctt 
agaacagccc 
aggctcctcc 
cgcccccgac 
gcaggtgagg 
tggggagtct 
tgcgatacgg 
gtagcagcag 
tgggggctgc 
tgtggaagca 
aaatgagtgt 
ggagaacttg 
cagagggtgg 
caaggtgctg 
ccagggccag 
tgcccactgt 
aagagggggc 
gaaggaggag 
caggaaatac 
agaaatacct 
ccacaaccat 
gaagcccatc 
ctggggcact 
ggagctggtt 
caaagaaggg 
cgaagatctg 
agggccagaa 
ataggatttt 
aaacaggctc 
ttttgcaggg 
agctcactac 
agctgggatt 
gggtttcgcc 
tcggcctacc 
tcttgaaccc 



cctcctccca 
ccccagctct 
cccaggagtc 
agcccctcct 
gtccaggccc 
ctctcagaac 
ccgagtccct 
tttcacgttc 
ctccccaggc 
aaaaggctcc 
gtccgaatca 
gatacagaca 
cccttctctg 
gtcccagctc 
ggaatgtagg 
cagatactct 
ttcctgttta 
ctcgtgcggc 
agggttgcgg 
gtgggaatgc 
tgcaggagcc 
gtaggagtag 
ttgcttgggg 
tgtgggggca 
ccccgtgtgg 
ccgtgtgtcc 
cgatccaacc 
gggggtcatg 
caactactct 
aaaaaaccgt 
tggggtttcg 
ggggctgggg 
acagtacgcc 
gtggttcagt 
gatctgatgc 
agcctggcag 
gtcaccagtc 
ggcccagtgg 
aaagtgatca 
actactaaca 
aagcccatcc 
atataatctt 
ctaagcacct 
ggacacagtt 
aatctctgcc 
acaggcaccc 
atgttgacca 
aaagtgctgg 
aggaaacctg 



caaaagatca 
ctcctccctc 
ctggccccta 
cggtcagacc 
ccagcctctc 
taaaatcttg 
cctctcccta 
aatcccctcc 
tctagtgcag 
agaatcgtgt 
gtaggtgacc 
tcaggccccg 
tcgagatcca 
ccagctccct 
aatccagtcc 
cggctgcctc 
ccggccccgc 
caagacgtgg 
aggcctccgg 
cgcgggggtt 
tctgtgggct 
gttccgtgat 
gcatagattc 
ttcttgggtg 
cagcgtgtgg 
atttgaaccc 
ctctccctcc 
agtgccaacc 
gtggcggtgt 
gagtggatga 
actcaggaag 
gcaatacccc 
tgggagacca 
ccatcccaca 
ttcttcaact 
atcattgcac 
cccgaggtag 
aacccaagct 
tgatgttgag 
agagtggtga 
tgtatggcgg 
aacacacttg 
tctcccccac 
tcactcaatc 
tcctgggttt 
gccaccacgc 
ggctggtctc 
gattacaagc 
ggccctggtc 



ggttccaagc 
acacccagga 
gcagccccct 
taaatcccag 
ctccctcaga 
gcccccagcc 
gcctcaggag 
gccccttctc 
gaggagaagg 
accaggcaga 
ccgcccctgg 
gaccctcctt 
gcagtctgga 
gtcccctcag 
cccagcctcc 
cccagtgccc 
cctctccatt 
atgttcctgc 
aggggaggga 
atgtgggtgc 
ttcctcaggg 
gctgtaaatt 
acctgggagt 
tgtgactctt 
aagcctggac 
acagtggcct 
tgctgcagga 
ccattcgcag 
ccttgtaggt 
tgggggcaga 
gagagagctg 
tgcctgggtc 
cagcctacag 
cccctgctac 
gcgtgaccag 
ccagcctggc 
tgggcttgtc 
attggcaaag 
attcacaagc 
gagaaagaac 
agagcactac 
ggatattttg 
ctccctcctt 
gcccaggctg 
aagcgattct 
gagctaattt 
gaactcctga 
atgagccacc 
acctcttccc 



WO 2004/075713 



40/47 



PCT/CA2004/000281 



tcagacccgg gagtctaggc ccttcttctc 

actctggacc ccaaaatctg gacctccaat 

gagagcccct cgccttcctt cacagtgaaa 

gcttctccag gaagtggcaa agtcagcatt 

5 aatgctttgg atgagacagt ttgcggctgt 

tcagttcaaa ctatggctct gccctttctc 

tcactttcct tatctgcaaa atgggaatca 

gttcaggtta catcagtttt cttcattgtg 

catggtagat gctcaataaa tatttgttga 

10 gaatgaatgc atctctcctt caaagcgctg 

ttgcttttgg cacgtagaga tgccagatgg 

aagtctattc ctcaaaagag gttaataaaa 

aactcttatt tactgagcac ctagtatgcc 

gccatgagca aactgtacaa agtccttgtc 

15 tagacaataa acaaatacac ataaaataaa 

gtggagcgta aaaggatagg gagtaatgga 

ggaaggcttc caggaggagg tgacacttgg 

ccggttgtct gagggcagga agactggaaa 

ctccaagtaa cccccagccc cttgatttga 

20 gatccaaggg cccctctcta gatggagaaa 

atgtggtgag aagtgatgtc ccctgtgcac 

aaactccccc attacatcct ggaaagaaag 

agggctgctc tctcagggaa ccttacaacc 

ctcaactgtg cagaagtaaa aatctttccc 

25 cagatcacag atggcatggt ctgtgcaggc 

agcaatttct gaaatccttc tcctcacaca 

tggatggggg tgggggtggt aagggagtga 

ccctatctct acggattggg agccaggttc 

cactgcatag ggattaggag cattgaattg 

30 acaggatggg atgcagagtt gggggctata 

agtctgaggc tagagtttag acgagttgcc 

aatttgtgcc gactttacca ccggcccggt 

gaatcccaag gctggttccc ttctcctgtg 

ccagttggct ggagtaaatg gtgccattga 

35 ctttctcttg ggatttgtca tagatataac 

tagcactttc ccctggcttt gtccacttcc 

gtccctaaat acaatgcttt cttccccatt 

tactgccatg aaagccagtc tctggattgt 

atttcagcca tggtctcttg gggatatggt 

40 gtgtgtgtgt gtgtccataa gaatcttgat 

atgatgcctt cactattggc caggaacaga 

tccctactgt cttctccctg gattggaccc 

cagggtctcg ctctgtcccc caagctggaa 

ctctgcctcc cagttcaagc aattctccca 

45 gtgtgcacca ccacacccag ctagtttttg 

ttggccaggc tggtctggaa cgcctgactt 

tgccgggatt gcaggtgtga gccaccaagc 

tgtcagtgtt ttgaccacta cccttagaca 

ccctgtcctg actggtctca ggggacaatg 

50 tcatgccaaa gaactggtaa aacgctggga 

agatgcaagg ctgttctcat tctccctgtc 

gatggtgcac tccagggcat cacatcctgg 

cctggcgtct ataccaacat ctgccgctac 

aagggctgat tctaggataa gcactagatc 

55 cttgcctgtc tctgttttgg ccctgtgggg 

gctggcagac taggtgggga tgtgcagaaa 

cactgcctaa agtgatccct cgttttctgg 

gagaaaatac atacaaaggt cttcaaacat 

gaaagagttt gtgtttagtt ttcctgagca 



ccaggaccca gcagtcctag ctccctcttg 
gaagctgtcc ctttgggact ccagaatcca 
acattgggac tcacctaaga gagtcaagga 
caggtccctg cctgcctcac tcctgctctg 
ggaaacacac gtgctcgcaa atcaagtaga 
cctgggcaaa ttcctttcca tctctgagcc 
ttaaccaata gattttttag ctacgtgagg 
ttctcctagt gctcagaatg gtgcctgcta 
atggatgacc tgatgaataa ataatgaaat 
ttgtgaggat taaatgagat tatgagcata 
aagtttttcc ccctcaaaga ggctttggag 
aagatcaatt ccaccttcaa tcattaattc 
tgaggtgctg ttgcaggcgc tggatataca 
gttatggagt tgcaagctag gtgggagaaa 
acattaggca aagtgctgta aggaaatatt 
gggatggtat tttttaattc gagtggtcag 
aaggagcagg atttagcaca gattgaggac 
gagagggtca gaggtatcat aaaggggcac 
aaatggctca gggtaagaaa aaacacgtga 
gcccaatagc aacaagtaca gcttcgttta 
agtgtcagaa aattccccca tgcagctgga 
ggggttagat ccagacaggg atggaggcaa 
tcttccccct cagagaattt tcctgacact 
cagaagaagt gtgaggatgc ttacccgggg 
agcagcaaag gggctgacac gtgccaggtg 
tccctcattg ccctctcgag gttcaaggct 
ccccaaagac ttggcacccg gagtgttcac 
agagaagcca aactctctct ctgaaagtca 
ctgctgctgt tttctttcga acgcttaact 
gagggtgggg tagatgtcca gcaaggagag 
tgctctctgg ttcccagcat tacatcgtgg 
ggtcagccac ttttctttga gaccgggcta 
attggttcct tgggagacaa tggtgtcttc 
cttcagtgct tcctcaatga gctccaatcc 
tctctcttct aactgcaact tcgttgttcc 
tggaagcccc accacctttg ccaatgactg 
ggccaaaaat ggagtcgttt ccatcagcga 
tctgtagaga tagtggtctc ttccacaaat 
gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 
cctttctcct atgtttggca actgcaatca 
ggaaacttca gctcagtcct ctccaagtaa 
tcgagaactc tttttttttt ttttttgaga 
tgcagtggca caatcttggc tcactgcagc 
cctcagcctc ccgagtagct gtgattacag 
tatttttagt agagacaggg attcaccatg 
caagtgatct accgcctcgg cctcccaaag 
ctgtctggga tttcattctt tcccctcttc 
ccatgtctgt ctgtacatgg aagccccaag 
cttttacccc cattggctac aggggaccaa 
cagcaggaaa agggacgttg tggacatctc 
tagggcgatt ctggaggccc cctggtgtgt 
ggctcagacc cctgtgggag gtccgacaaa 
ctggactgga tcaagaagat cataggcagc 
tcccttaata aactcacaac tctctggttc 
agggctggat ggggatccgg gattgttcct 
ccaagttctc atggtcactt tggccatcac 
aagaacttgg gtaagagctt tatttcaggg 
tcagtgggct ggtatgtgaa agacagtttt 
aagcatttac aagctttgga gataaaattt 
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tcccccttta aaaaataatt 



SEQ ID NO. 16 
5 HklO amino acid 

MRAPHLHLSAASGARALAKLLPLLMAQLWAAEAALLPQNDTRLD 
PEAYGAPCARGSQPWQVSLFNGLSFHCAGVLVDQSOTLTAAHCGNKPLWARVGDDHLL 
LLQGEQLRRTTRSVVHPKYHQGSGPILPRRTDEHDLMLLKLARPVVPGPRVRALQLPY 
10 RCAQPGDQCQVAGWGTTAARRVKYNKGLTCSSITILSPKECEVFYPGWTNNMICAGL 
DRGQDPCQSDSGGPLVCDETLQGILSWGVYPCGSAQHPAVYTQICKYMSWINKVIRSN 



SEQ ID NO. 17 

15 KLK10 nucleic acid 

Gene 1...1580 
CDS 220...1050 



20 catcctgcca cccctagcct tgctggggac gtgaaccctc tccccgcgcc tgggaagcct 

tcttggcacc gggacccgga gaatccccac ggaagccagt tccaaaaggg atgaaaaggg 

ggcgtttcgg gcactgggag aagcctgtat tccagggccc ctcccagagc aggaatctgg 

gacccaggag tgccagcctc acccacgcag atcctggcca tgagagctcc gcacctccac 

ctctccgccg cctctggcgc ccgggctctg gcgaagctgc tgccgctgct gatggcgcaa 

25 ctctgggccg cagaggcggc gctgctcccc caaaacgaca cgcgcttgga ccccgaagcc 

tatggctccc cgtgcgcgcg cggctcgcag ccctggcagg tctcgctctt caacggcctc 

tcgttccact gcgcgggtgt cctggtggac cagagttggg tgctgacggc cgcgcactgc 

ggaaacaagc cactgtgggc tcgagtaggg gatgaccacc tgctgcttct tcagggagag 

cagctccgcc ggaccactcg ctctgttgtc catcccaagt accaccaggg ctcaggcccc 

30 atcctgccaa ggcgaacgga tgagcacgat ctcatgttgc tgaagctggc caggcccgta 

gtgctggggc cccgcgtccg ggccctgcag cttccctacc gctgtgctca gcccggagac 

cagtgccagg ttgctggctg gggcaccacg gccgcccgga gagtgaagta caacaagggc 

ctgacctgct ccagcatcac tatcctgagc cctaaagagt gtgaggtctt ctaccctggc 

gtggtcacca acaacatgat atgtgctgga ctggaccggg gccaggaccc ttgccagagt 

35 gactctggag gccccctggt ctgtgacgag accctccaag gcatcctctc gtggggtgtt 

tacccctgtg gctctgccca gcatccagct gtctacaccc agatctgcaa atacatgtcc 

tggatcaata aagtcatacg ctccaactga tccagatgct acgctccagc tgatccagat 

gttatgctcc tgctgatcca gatgcccaga ggctccatcg tccatcctct tcctccccag 

tcggctgaac tctccccttg tctgcactgt tcaaacctct gccgccctcc acacctctaa 

40 acatctcccc tctcacctca ttcccccacc tatccccatt ctctgcctgt actgaagctg 

aaatgcagga agtggtggca aaggtttatt ccagagaagc caggaagccg gtcatcaccc 

agcctctgag agcagttact ggggtcaccc aacctgactt cctctgccac tccctgctgt 

gtgactttgg gcaagccaag tgccctctct gaacctcagt ttcctcatct gcaaaatggg 

aacaatgacg tgcctacctc ttagacatgt tgtgaggaga ctatgatata acatgtgtat 

45 gtaaatcttc atggtgattg tcatgtaagg cttaacacag tgggtggtga gttctgacta 

aaggttacct gttgtcgtga 

SEQ ID NO. 18 
KLK10 nucleic acid 

50 

Gene 1. .5574 

mRNA join (48. .120,605. .701,2455. . 2635, 3589 3863, 4195 . .4328, 4793.-5474) 
CDS join (614. . 701 , 2455 . . 2635, 3589 . . 3863, 4195 . .4328,4793. .4945) 
Promoter 1...47 
55 5'UTR join(48. .120,605. .613) 
exon 48...120 
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exon 605...701 

ttggggtcaa aagggaaggt cccgccaggg 
cacaggcagg gcagaggcac gtctgggtcc 
5 tgaagctacc tgcaccccac ccgggttggg 
cgggccacag gcaggagccg gcttagagcc 
cccccgctgt gaggttgaga aagaggctcc 
gcgcgtgaag gagggggaaa cctcgggcgc 
cggagagcgc tccactcggg cacagggagg 

10 ggtaattgtg cccattaccg tgttccagcc 
atcggctgca ggtcaaaagg gtgcttgcgg 
gcggcgaggt gggggtctct aacggagcat 
gcagatcctg gccatgagag ctccgcacct 
tctggcgaag ctgctgccgc tgctgatggc 

15 gcggggagag gcgccggtgg gaggcacggg 
cccgcgcacc tctcctcccc cgagacccca 
gcccccaagt cagctgggcc cttcttctgg 
gccctgtccg atttctgcca cccggatcct 
ttctcctcct cttacgccgc cccatcccgg 

20 ttttgtttgt ttgtttgttt gagacggagt 
ggcgcgatct cggctcactg caagctccgc 
gcctcccgag tagctgggac tacaggcgcc 
tttagtagag atgggatttc accgtggtct 
cggcctccca aagtgctggg attacaagcg 

25 ttttatcctg agtcttgcct taccgctttt 
tttccctttc tctctctttt ttctttttct 
tccttccttc ctttccttcc ttcctttcgt 
cttatttttt cttctttctt tcttgttttt 
tttttttttt tttttttgag acagggcagt 

30 ccaatcagag ctcactgcag cctcgacctc 
tagctggtac cacaggcatg caccaccaca 
tttttttgag acagggtctc actctgtcgc 
ctcattgcaa cctccacctc ctgggctcaa 
ctgtgactac aggcgcatgc ctccgcgact 

35 ttttgtagag actgggtctt gctgtgttgc 
cggtccaacc gcctcggctt cccaaagtgc 
acccctctcc tggttttcaa tcccgttttg 
cgagttctat ccccgcaccc ttacctcccc 
cagcgacgtt ctagccagct ctccaggcgc 

40 ccctattctg tctcattact ccacgctacc 
ttcttctcct cccattcctt cggtgcacgc 
cgttctggct gcgtcccccg tccccgaacg 
caaacccaga ccctgccctc cgtgctttga 
aggcggcgct gctcccccaa aacgacacgc 

45 gcgcgcgcgg ctcgcagccc tggcaggtct 
cgggtgtcct ggtggaccag agttgggtgc 
aggagatcca tccccgagga cgccacgggg 
ggcagggcgg ggtctctcgg aactcccaca 
gcaggaggag agaccgggct tgcgctgtgg 

50 atttctgggc ccgtcactcc tctcccgctg 
gcctgcaacg gatagaagcc agggtccggt 
gttcacttcg tgagtccccg ttgaaggagg 
ggagttgggc tgggctcggt ggctcacgcc 
gggtggatca cctgaggtca ggagttcaag 

55 ctcaactaaa aataaaataa aataaaaaat 
aaggctgggt gtggtggcgg gtgcctgtaa 
ggagaactgc ttgaacctgg gaggtggagg 
tctagcctgg gtgacaagag cgaaactctg 
aaatgtggaa ggcttaccta ggtgtccagg 



gtccctgggc agaggatacc agcggcagac 
cctccctcct tcctatcggc gactcccagg 
gtggattgcg agaggatggg tgggaacccc 
tcggtttctc cactgcggga cgcggaagtc 
cactggctcc gagcctcgga tccccacccc 
ggcgtggctg cagccagggt agctggggcg 
acgggaagat gccgcgaggg gcgtcattag 
cgaatctccc gtctgccagc ccctggggtt 
gctggggtgg cccaggctgg gcggatgagc 
ctgttttaac ccgcccctgc acacacccca 
ccacctctcc gccgcctctg gcgcccgggc 
gcaactctgg ggtaaggtgg gggacagggg 
cgggagggca atgtgttccc gtcaccaagc 
gcacccaccc agcgctccgg agcacggccc 
ctcggcccct gggtgacccg ccccactcag 
cgctcttccg tggactcttc ggcgtgttct 
ccccgctcca ttttacacta ccgtgttttg 
ctcgctctgt cgcccaggct ggagtgcagt 
ctcccgggtt cacaccattc tcctgcctca 
cgccaccaag cccggctagt tttttgtatt 
cgatctcctg gcctcgtgat ccgcctgcct 
tgagccaccg ctcccggcca caccacagtg 
tgccctctcc cctcactttt tttcttcctc 
ttctttcttt tctttctttc gttcttcttt 
ttgtttcttc tttcttgttt tactttctct 
tttctttctc tctctttttc tttccttctt 
gctctgtctc cgtggctgga gtacagtggc 
ctgggctcaa gcgatactca gcctccagag 
tccggctttt tttttttttt tttttttttt 
ccagactgga gtgcagtggc ccaatctcgg 
gcgatcctcc cacctcagcc tcccaggtag 
acttttttgt tgttgttgtt gtttgtttgt 
ccgggctggt cttgaattcc cgagctcaag 
tgggattaca ggcgtaaacc actgcgcccc 
ttattcacac cccttcctct ccccgatccc 
gccgcgttca atccccgccc ctctatcgac 
gctgcgttca gtccctgccc tccagaccca 
ctatcccagc ttccttccac tttcacgcgc 
gaaaccccca atatttccct accaccctcg 
cagtccagtg ccacagccca gctccaaccc 
ttccgtcccc tttctttctc ccagccgcag 
gcttggaccc cgaagcctat ggcgccccgt 
cgctcttcaa cggcctctcg ttccactgcg 
tgacggccgc gcactgcgga aacaagtagg 
ggctgtggag gcggcctcca gggaggagcg 
gctggaggtg cggtccccgg tgtccttcca 
ccgccagggg acggtgtggt ccttttccgc 
attctccttg agctctagga ggaggtggtt 
gtggtcagga atttagaact aaaggaaaag 
aagggttggg tattaccaca gagaaaatgt 
tgtaatccca gcactttggg aggccaaggc 
accagcctgg ccaacatggt gaaaccactt 
aaaaaatttt aaaaaaatta aaaaaaaaaa 
tcccagctac tcaggaagga ggctgaggca 
ttgcagtgag ccgagattgc gccattgcat 
tttcaacaaa gaaaagaaaa agagaaaaga 
cccccagccc tcctcaattc ctgcagatcc 
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tcagagctca aacaactgat tcctcctccc catgtccact gaggtcccct tctcccacaa 
ggccctcttc cctcagactc ttcctatctc caggccctgc ttcactgccc acctgctttc 
ccagtccctg tgaagggttt gccttcacat gcctcttcct tcccccaggc cactgtgggc 
tcgagtaggg gatgaccacc tgctgcttct tcagggcgag cagctccgcc ggacgactcg 
5 ctctgttgtc catcccaagt accaccaggg ctcaggcccc atcctgccaa ggcgaacgga 
tgagcacgat ctcatgttgc taaagctggc caggcccgta gtgccggggc cccgcgtccg 
ggccctgcag cttccctacc gctgtgctca gcccggagac cagtgccagg ttgctggctg 
gggcaccacg gccgcccgga gaggcaagag ctggggctct gaggccagaa cctcaggagg 
agggggctga gggcctgaac ccctgggtct gaggaaggat gggctgggga ctggattcct 

10 ggatctgagg gaggacgggc tggggtccta gatgcctggg tctgtgagtc tgaggggagg 
aggggctggg ggcctggact cctgggtcta agtggggagg ggctggggcc aggattcttg 
agtctgaagg aggaggggct ggggcttagg atagaaacgg tcttgtatct ggactcctgg 
ctccccaagg attgggggct ggacccaggg attactggca tattctccct tcagtgaagt 
acaacaaggg cctgacctgc tccagcatca ctatcctgag ccctaaagag tgtgaggtct 

15 tctaccctgg cgtggtcacc aacaacatga tatgtgctgg actggaccgg ggccaggacc 
cttgccaggt agggtctgaa cagggagagt ctctgactcc tgggagggag gacagggagg 
ttatgggaaa agagcagacc ctgtgcccga tcccaaactc cattcccaaa cccatccttg 
accccaactc ttacccagac ctaaccccct cctcatccct atcctcaatc ccatttccat 
cctaacccca ccccattccc atctccaagc ccattttcat cccctcacct tccatgaact 

20 acaatcccaa cccaagtctc actgtgcctt cattctcatc ccccagccca acctcccata 
acctgaagtc cacctccatt cctaccttcc agctcatacc taattccaac cccatcccat 
cctcgtcttt atcccaaccc aaccccttcc ttccccacca ctgccccaga tcccaaagtg 
acagctctca cgttggcaca tttatttgat ctctcctttc tgccaccccc agagtgactc 
tggaggcccc ctggtctgtg acgagaccct ccaaggcatc ctctcgtggg gtgtttaccc 

25 ctgtggctct gcccagcatc cagctgtcta cacccagatc tgcaaataca tgtcctggat 
caataaagtc atacgctcca actgatccag atgctacgct ccagctgatc cagatgttat 
gctcctgctg atccagatgc ccagaggctc catcgtccat cctcttcctc cccagtcggc 
tgaactctcc ccttgtctgc actgttcaaa cctctgccgc cctccacacc tctaaacatc 
tcccctctca cctcattccc ccacctatcc ccattctctg cctgtactga agctgaaatg 

30 caggaagtgg tggcaaaggt ttattccaga gaagccagga agccggtcat cacccagcct 
ctgagagcag ttactggggt cacccaacct gacttcctct gccactcccc gctgtgtgac 
tttgggcaag ccaagtgccc tctctgaacc tcagtttcct catctgcaaa atgggaacaa 
tgacgtgcct acctcttaga catgttgtga ggagactatg atataacatg tgtatgtaaa 
tcttcatgtg attgtcatgt aaggcttaac acagtgggtg gtgagttctg actaaaggtt 

35 acctgttgtc gtgatctgac cacgtcccgg tgaaagcgtg tgtccaggga agaagtgcac 
agggtagccc ccaatcccaa ccttccatcc ccaaccctta gggatgatgg aaga 



SEQ ID NO. 19 

40 Hkll amino acid 

MQRLRWLRDWKSSGRGLTAAKEPGARSSPLQAMRILQLILLAIA 
TGLVGGETRIIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTAAHCLKPRYIV 
HLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPVSITWAVRP 
45 LTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTM 

VCASVQEGGKDSCQGDSGGPLVCNQSLQGIISWGQDPCAITRKPGVYTKVCKYVDWIQ ETMKNN 

SEQ ID NO. 20 
Hkll amino acid 

50 

MRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTR 
LLCGATLIAPRWLLTAAHCLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSL 
PNKDHRNDIMLVKMASPVSITWAVRPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHT 
LRCANITIIEHQKCENAYPGNITDTMVCASVQEGGKDSCQGDSGGPLVCNQSLQGIIS 
55 WGQDPCAI TRKPGVYTKVCKYV DW IQETMKNN 
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SEQ ID NO. 21 
KLK11 nucleic acid 

aggaatctgc gctcgggttc cgcagatgca gaggttgagg tggctgcggg actggaagtc 61 

atcgggcaga ggtctcacag cagccaagga acctggggcc cgctcctccc ccctccaggc 121 

5 catgaggatt ctgcagttaa tcctgcttgc tctggcaaca gggcttgtag ggggagagac 181 

caggatcatc aaggggttcg agtgcaagcc tcactcccag ccctggcagg cagccctgtt 241 

cgagaagacg cggctactct gtggggcgac gctcatcgcc cccagatggc tcctgacagc 301 

agcccactgc ctcaagcccc gctacatagt tcacctgggg cagcacaacc tccagaagga 361 

ggagggctgt gagcagaccc ggacagccac tgagtccttc ccccaccccg gcttcaacaa 421 

10 cagcctcccc aacaaagacc accgcaatga catcatgctg gtgaagatgg catcgccagt 481 

ctccatcacc tgggctgtgc gacccctcac cctctcctca cgctgtgtca ctgctggcac 541 

cagctgcctc atttccggct ggggcagcac gtccagcccc cagttacgcc tgcctcacac 601 

cttgcgatgc gccaacatca ccatcattga gcaccagaag tgtgagaacg cctaccccgg 661 

caacatcaca gacaccatgg tgtgtgccag cgtgcaggaa gggggcaagg actcctgcca 721 

15 gggtgactcc gggggccctc tggtctgtaa ccagtctctt caaggcatta tctcctgggg 781 

ccaggatccg tgtgcgatca cccgaaagcc tggtgtctac acgaaagtct gcaaatatgt 841 

ggactggatc caggagacga tgaagaacaa ttagactgga cccacccacc acagcccatc 901 

accctccatt tccacttggt gtttggttcc tgttcactct gttaataaga aaccctaagc 961 

caagaccctc tgcgaacatt ctttgggcct cctggactac aggagatgct gtcacttaat 1021 

20 aatcaacctg gggttcgaaa tcagtgagac ctggattcaa attctgcctt gaaatattgt 1081 

gactctggga atgacaacac ctggtttgtt ctctgttgta tccccagccc caaagacagc 1141 
tcctggccat atatcaaggt ttcaataaat atttgctaaa tgagtg 

SEQ ID NO. 22 
25 KLK11 nucleic acid 

gene 2313.. 7622 

mRNA join (2313. .2398,4189. .4263,5061. .5217,5545. .5810, 
30 6627. .6763,7158.-7622) 

CDS join (4224. .4263,5061. .5217,5545. .5810, 6627. .67 63, 7158. .7310) 



35 tgataatagt gttctctctc ctcattggtc agggccccag ccattgtcct tgagagaatg 61 

ctcgactctt tatgttgtct tgacagcctc ccctgagatt ggtcattaat gactgtgctc 121 

tctctcctca ttggtcaggg ccccagccat tgtccttgag agaacctctg tcctttatgg 181 

agttccaccc ttcttccctg ggattggccc ctagagacag tggttcttct cttttggtta 241 

gccattgcca ttgtcctccg ggaaagtgat tatactcttt tgtctaatga ccagacttgg 301 

40 agccctcccc aaggcccagg actgggttga agggttgggg aggaaaacag aaataagatg 361 

tctcccttgt tcagacagta cttctcttcc cttccagggt gattctgggg gccccctggt 421 

gtgtggggga gtccttcaag gtctggtgtc ctgggggtct gtggggccct gtggacaaga 481 

tggcatccct ggagtctaca cctatatttg caagtatgtg gactggatcc ggatgatcat 541 

gaggaacaac tgacctgttt cctccacctc cacccccacc ccttaacttg ggtacccctc 601 

45 tggccctcag agcaccaata tctcctccat cacttcccct agctccactc ttgttggcct 661 

gggaacttct tggaacttta actcctgcca gcccttctaa gacccacgag cggggtgaga 721 

gaagtgtgca atagtctgga ataaatataa atgaaggagg ggccatgtct gtccatttga 781 

agtcctcatg ctggttgaga ctggaagaag gactcagcag tttccctatc tcataggagt 841 

agaaacagag ctcaaataag gccaggcaca gtggctcaca cctgtaatcc catcactttg 901 

50 ggaagctgag gcaggtggat cacctgaggt caggaactcg ggaccagcct ggtcaacata 961 

gtgaaacccc aactctacta aaaatgcaaa aattagccag gcatggtggc gcatgcctgt 1021 

aatcccagct actcaggagg ctgagacagg agaatagcat gaacccgtga ggcagaggct 1081 

gcagcgagcc gagattgaac cattacactc cagcctgggc gacagagcga gactccatct 1141 

caaaaacaaa caaacaaaaa acccagtgct caaataggat gagggtcttc cctgagtagt 1201 

55 tactcagaaa tggagtagaa aaagttactt ttaataatat aggccgggtg cagtggccca 1261 

cgcctgtaat cccagcactt tgggaggccg aggtgggagg atggcttgag ctcagatttc 1321 

gagatcagcc tggcaacaca gtgaaatctt gtcactacaa aaacacaaaa aattagctgg 1381 
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gtgtggtggt gcgtgcctgt agtcccagct 
gagccgggga ggtggaggct gcaaagagcc 
aataaagtga gaccttgtct caaaaacaaa 
cttcatctgg cataatagaa atagtgccca 
5 agacccgaaa aagaaaaaga aaattgttag 
caacatttat gaccatttaa tccaatgtcc 
ctgcagttta ctttcttgta atgaagcata 
gttctaataa gacgaagggt ggagtgcagg 
tttaatcctc ctgcccaccc cttggattct 

10 gaccatctcc ccaaatgcac tgaagggaaa 
taccagcgga ggcacatttg ctgagccccc 
ctggaagcct gatcccaacc tcccctgcaa 
agaaactgag ccttgcaggg gtggagtccc 
agtaggaaga ggaagcacct aggtttgagg 

15 ctcccctgcc ttgctccaca cctggtcagg 
acctaactga aaacaaacaa gctgggagaa 
cagaggttga ggtggctgcg ggactggaag 
aagtgaacag ctggactcgg gctgcctggg 
agcagaggag cgaggcccca gaggagccct 

20 ctttcctgga ctcggcttcc acaggccctg 
ctgtgcctgg cagcagcccc acctgtgtga 
aggagaaggg agcggcctag gggaggccag 
gagtggctgg gacgggagga aaaagagaga 
agacaaattg ccagagatgc agtcagagag 

25 ttagagaaag ggccacacag agccagacag 
gaggacagag aaaggcagac agacacatag 
gaattactga atgacaggga atgacacata 
agggaaagga aggctgcaga cagacagaca 
aggggcttgg agaggtggca caggcaggca 

30 cctcacacac accccgcccc ggggcattaa 
ccgcccaggg ccgcccccct gccagcccgc 
cccagcctac ctgctgtagc tgccgccact 
ccccagcccc agagcctgtg agtccaggag 
gtcagccctc cccaaggaca cctgtcccac 

35 cttctctgct tgggtggggg ttcctggcct 
ccgcagcctc agttacctct aatctccatg 
cacccccgct ggccagggca gcggagggca 
ctttgcttgg ctgtccttca aaagggcagg 
gtgatggggg tcccagacat tgtctggggc 

40 aaaggctcca tatcgctctg ctgcgaagac 
ctggttcccc ttgtccttcc accccaagct 
aggaaaacat cctctgacat gtgccgggga 
ccatggcttg acagctcaaa gcccctccca 
aagcttctag ctgtgaccag gccctctcca 

45 aagtgtttgg agatctgagt gctgtgagaa 
aagtggggag cgggagcagg tgagggagag 
cgatttcccc gtaaagtgat gcggccccat 
cccgctcctc ccccctccag gccatgagga 
caggtacgca ggggatgggg gcagggcagg 

50 aaccctctgt gtctggacag tgacagggct 
acctgaactg gagcagtggt catgagggcc 
gccaaaggag gagaggtcaa gggggtcgta 
gagttggggg agcagtcact caaagcgccc 
aagtatttcc tttttttttt ttcccagaga 

55 tgccgtggtg ccaacacggc tcactgcagt 
gccatctcag cttccccggt agctgggacc 
gtttaattgt ttgtagagat gggggaggag 
aactcctggg ctcaagtaat cctcccacct 
tgagccactg catttgacct tatggaagta 



acttgggaag ctgaggtggg aggatcaccc 1441 
gagatcatgc cactgcactc cagcctgggc 1501 
aacccagcaa tataaataag acacatgttt 1561 
gagcttataa gcttttcaag agtccacaaa 1621 
ctccaaaata ccagatgaaa gctgcaaagt 1681 
ataaaacgta gcattctttc cactagccaa 1741 
cattgtatct ttaatgtggg acgtggcttt 1801 
cttggaaagc aggagagctc agcctacgtc 1861 
gtctccactg ggactcaaga ggtgaggaga 1921 
ctggaggagg gagggagtga ggggtgatca 1981 
ccgcagtctg ctctttccaa gtggaccctc 2041 
gcaggtctgt cacccccatc tctcagatga 2101 
ttgtccccac gtcataaggg tagtcatagt 2161 
ccagggctgg ctgctgtcag aacctaggcc 2221 
ggagagaggg gaggaaagcc aagggaaggg 2281 
gcaggaatct gcgctcgggt tccgcagatg 2341 
tcatcgggca gaggtctcac agcagccagt 2401 
cggcagggag aagcgggcag gggaagggtc 24 61 
ggggtggagc acagccaagg gctctgttcc 2521 
acctgcctcc cccaccctcc ggtcctgccc 2581 
catcccagca caccccccct ctccttgcaa 2641 
gggcccacct gggctggggc tgtggagagg 2701 
cggagattag atggaagaag agggatttca 2761 
actgactgag agacacaaag atagaaggaa 2821 
agagagaaga gtggagatgg agacagggac 2881 
ggacagaaag agaaaaatca cacaaagtca 2941 
gaacgagaca cagattcaga gactcagggc 3001 
gacagaggga ggctgagaca cagggagaag 3061 
gccagtgcct cagaggcctc cggggagggc 3121 
ggcagggctt ggaggccagt catcctgggc 3181 
ctgcctggtg cctggcacct ggcgctccaa 3241 
gccgtctccg ccgccactgg gcccccagag 3301 
gaaagggaag ctgcccctcc ccgtccaggt 3361 
tcgggcaccc atttctccct ctgctctgtc 3421 
ctctctacac ctctcacctc cgatggctgt 3481 
gcttcagctg ctgagctggc cctctgctcc 3541 
ctggccctcc cctcgaccag ccccgcccag 3601 
ggtttggcgg acagggcttc agcaagccgg 3661 
tgagccccct actcccctcc agcagacctc 3721 
aatgaaaaag gggtggctac ggaacggtgt 3781 
gctggggcct ggccagctct caaggcaaga 3841 
ggtcccatgg ctgacttgaa cagggccgaa 3901 
acgacttcca catggttctt ggtatctcgg 3961 
aggccaccct agacacctaa gatatatttt 4021 
acaggggatt tccccaacct tgtttctccc 4081 
aggagagggc atgagccagc ccccccctcc 4141 
gtccctcctt gttcccagag gaacctgggg 4201 
ttctgcagtt aatcctgctt gctctggcaa 4261 
atcctccctc ttgaatctct gggatcccct 4321 
gattccaaat tacagaacaa cccataaggc 4381 
tggatgccct tctagataat ccctttaaat 4441 
aagggtcccg tggaggggct gaggaagctg 4501 
aggacagggg ctactgacca accagtatgg 4561 
caaagtcttg ctctattgtc caggctggag 4 621 
cttgacttcc cgggcttaag tgatccttaa 4 681 
acaggcacct gccaccaagc caggctaatt 4741 
gtctcactat gtttgcctag gctgatctag 4801 
tagcctctca aagtgctggg attacaggca 4861 
ttttcatcct ttaatacccg accccagcat 4921 
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ccagggcaac ccagagggac accagaccag 
ccccaccccc atttctggga gtcctcctgg 
tttgctctca ccccctccag ggcttgtagg 
gtgcaagcct cactcccagc cctggcaggc 
5 tggggcgacg ctcatcgccc ccagatggct 
ggtgcggggg ctggggcggt gccggggtgg 
aagctcaggg ataggggtgc tggtaagggg 
ggttgatggg ctcgagttgg tattgaaggt 
tggctgggaa gggggcttcg gtgggagacg 

10 ttcatcctca aaggtgtcac tcacctctcc 
caactactgt ctctcccacc tcagccgcta 
gaaggaggag ggctgtgagc agacccggac 
caacaacagc ctccccaaca aagaccaccg 
gccagtctcc atcacctggg ctgtgcgacc 

15 tggcaccagc tgcctcattt ccggctgggg 
cagaggggaa cctggcaggg ggtggtgagg 
gggcatcaga gatgcggttc acagtgacga 
acgtcaggat aggggggtgg ggacaaaagt 
gcaatcatac atccataacc tcctggttgt 

20 gaatcttgat tttcttctct ataaaatgag 
tagagataat gtatatcaag caactgacat 
cgtggctcac gcctgtaatc ccagcacttt 
tcaggacttt gataccagcc tggccaacgt 
aaattagttg ggcgtggttg tgtgcgcctg 

25 gagaatcgct taaacttggg agacggaggt 
ccagcctggg caacagagca agactctgtc 
attgctgttg ctattgttac aagaagagag 
cccccagggg cgggatcaca gcaagcactg 
cacagcccct cacgctgttt ccacagtacg 

30 caccatcatt gagcaccaga agtgtgagaa 
ggtgtgtgcc agcgtgcagg aagggggcaa 
ccacagcccc atccccatcc ccagcttcaa 
ccccaacctc aacccgccga cccctgcaac 
tctgacctca gcacaaactt cagctccatc 

35 atccccaaac tcgtttttga gcctaacccc 
atcgctaaac ctatcacctt tcccagtgcc 
caccgtcccc acctcctccc tggctaacac 
tctctccccg tgcccagggt gactccgggg 
gcattatctc ctggggccag gatccgtgtg 

40 aagtctgcaa atatgtggac tggatccagg 
cccaccacag cccatcaccc tccatttcca 
ataagaaacc ctaagccaag accctctacg 
gatgctgtca cttaataatc aacctggggt 
tgccttgaaa tattgtgact ctgggaatga 

45 cagccccaaa gacagctcct ggccatatat 
tgaatctact gagtgcttac tatgtgctag 
attttttgac agagtctcgc tctgtcaccc 
cactgcaacc tccacctcct gggttcaagc 
gggattacag gtgcctacca ccacatccgg 

50 gcttcaccat gttggccagg ctggtctcga 
ggcctcccaa actcctggga ttacagacgt 
ttaattaaaa gaaattaaat taattaatct 
ggctggagtg cagtaacaat cacagctcac 
tgtcctccct cagcctccag agtagctggg 

55 atttttgtat ttttcgtaga gacagaggtc 
cctgggctca agcagtctgt cctcctcagc 
tcgctgtgcc tggcctccaa gcactttcaa 
tgaggtcggt actgttttca tacctatttt 
agtcacttgc tcacagtcac gtggctagga 



ggcccagacc acccactctc tttctctcct 4981 
tctaccacct ctccttcctg agccccttct 5041 
gggagagacc aggatcatca aggggttcga 5101 
agccctgttc gagaagacgc ggctactctg 5161 
cctgacagca gcccactgcc tcaagccgtg 5221 
ggggctggga atggggagat ggatggagag 5281 
attagagatg gggatgggta gtgtcagcaa 5341 
ggggggatga atggggttgg gatggggcta 5401 
tggaagaggt tggaagcaga gcgatgtttc 5461 
cacccatgtc tcccccgacc tttcctcctc 5521 
catagttcac ctggggcagc acaacctcca 5581 
agccactgag tccttccccc accccggctt 5641 
caatgacatc atgctggtga agatggcatc 5701 
cctcaccctc tcctcacgct gtgtcactgc 5761 
cagcacgtcc agcccccagt gtaggagcac 5821 
agggagtggt caggattgtg gaagggttca 5881 
tgtgggataa gttgagagga tgtgtggaaa 5941 
tggggccttg gagtcagacg gacgggatat 6001 
aagaccttag gcaagcagct tcacctctct 6061 
aatgattata cccacctgtc aggattggat 6121 
aaatcattta ttggatagca ggctgggcac 6181 
gggaggccga ggtgggaaga tcacctgagg 6241 
ggtgaaatcc catctctact aaaaatgtga 6301 
taatcccagc tactcgggag gttgaggcag 6361 
tgcagtgagc caagatcacg ccactgcact 6421 
tcgaaaaaaa aaaaaaaaaa gctggatagc 6481 
gtgagttggc tgcgtctaag gacagggatt 6541 
cattagggga ggtggcaggg ggctcattcc 6601 
cctgcctcac accttgcgat gcgccaacat 6661 
cgcctacccc ggcaacatca cagacaccat 6721 
ggactcctgc caggtcagtg tggtctccaa 6781 
tgacatcttt accgacatcc acaatttcat 6841 
tcccaatcca tctcttcccc tgttcccgtt 6901 
cccgtttcca caccatttcc agctccaacc 6961 
atcctttatc ccacccataa tcccagcttt 7021 
tacccatcct gtctcggccc cactcctaag 7081 
catgctcaac gctttctctg accgacattc 7141 
gccctctggt ctgtaaccag tctcttcaag 7201 
cgatcacccg aaagcctggt gtctacacga 7261 
agacgatgaa gaacaattag actggaccca 7321 
cttggtgttt ggttcctgtt cactctgtta 7381 
aacattcttt gggcctcctg gactacagga 7441 
tcgaaatcag tgagacctgg attcaaattc 7501 
caacacctgg tttgttctct gttgtatccc 7561 
caaggtttca ataaatattt gctaaatgag 7621 
accctgatcc aatggctttt attttatttt 7681 
aggctggagt acagtggtgc tatctctgct 7741 
aattctcctg cctcagcctc ctgaatagct 7801 
ctaatttttg tattttttag tagagatggg 7861 
actcctgacc tcagatgatc tgccctcctt 7921 
gagccaccgc gcccgcccgg ctttcattta 7981 
atttaggaga cagtcttgct ctgttgccca 8041 
ggcaatctca atttcctggg gtcaagtgat 8101 
actacaggca catgccacga agcccagcta 8161 
tcagtatgtt gccccggcta gtctcaaact 8221 
ctccaaaagt ggtgagatta caggcatgag 8281 
atgtatcaac ttaatcctca caaaaccctg 8341 
atagttgaag aaacagacac agagaagcaa 8401 
gagcaaggat ctgaagcaag gcgatctctt 84 61 
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aattaccaag tgatgttcct ggagtaaggc 
catgcaaaag tataacacag taagtaaaga 
accaaaggag cttattgttt ttctccaact 
tctgctggcc aatatggtag ccactaaccg 
5 agccagaaat gtaactcctc tgtctcactt 
tgactggtgg tggctggatt agctagtgca 
agcagcactg agttagatgg cctctgaaga 
ggatcttttt tttttttttt ctttgcgaca 
cagtggcgtg atctcggctc actgcaactt 
10 ctcagcctcc ctagtagctg ggactacagg 
gtctttttag tagagatggg gtttcactat 



tctgtttgtt tcctttcctg taaaatgctg 8521 
agtcagttag cctgcacata ctaagaccta 8581 
tccatgatag gtaattagat agtggagacc 8641 
cagctggctc ttccaattaa aattacataa 8701 
gttatatctc caaggctgga tagccacatg 8761 
tataaaacat cactgcagaa agttcagctg 8821 
ggatgtccca cggagagaat ccagaactca 8881 
gagtcttgct ctgtcaccca ggctggagtg 8941 
ctgcctccca ggttcaagca attctcctgc 9001 
cctgtgccaa catccccagc taatttttgt 9061 
gttggccagg ctggtctcga actcctgacc 



